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CN ' Abstract 

Persistence theory discussed in this paper is an application of algebraic topology (Morse 
Theory) to Data Analysis, precisely to qualitative understanding of point cloud data. Math- 
£NJ 1 ematically a point cloud data is a finite metric space of a very large cardinality. It can 

be geometrized as a filtration of simplicial complexes and the homology changes of these 
complexes provide qualitative information about the data. There are new invariants which 
permit to describe the changes in homology (with coefficients in a fixed field) and these 
■ invariants are the "bar codes" . 

In Section [3] we develop additional methods for the calculation of bar codes and their 
refinements. When the coefficient field is Z2, the calculation of bar codes is done by ELZ 
algorithm (named after H. Edelsbrunner, D. Letscher, and A. Zomorodian). When the 
coefficient field is R, we propose an algorithm based on the Hodge decomposition. 
{Sj \ The original persistence theory involves the "sub level sets" of a nice continuous map 

(tame map). With Dan Burghelea and Tamal Dey we developed a persistence theory which 
. involves level sets discussed in Section 4. This is a refinement of the original persistence. The 

level persistence we propose is an alternative to Zigzag persistence considered by G. Carlsson 
and V. D. Silva. We introduce and discuss new computable invariants, the "relevant level 
persistence numbers" and the "positive and negative bar codes" , and explain how they are 
related to the known ones. 

We provide enhancements and modifications of ELZ algorithm to calculate such invariants 
^ ■ and illustrate them by examples. 

Sections 3 and Section 4 are preceded by background materials (Section [2} where the 
concepts of algebraic topology used in this paper are defined. 



1 Introduction 

We view "Persistence Theory" as the Computer Science friendly application of Morse Theory to 
Data Analysis. 

• Data considered in this thesis is so called PCD (point cloud data) which is mathematically 
a finite metric space (£, d) of very large cardinality. A PCD often appears as a collection of 

1 This paper is the Ph.D thesis written under the direction of D. Burghelea at OSU. 
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points in some Euclidean space R with d being the Euclidean distance. It is hard to visualize 
a PCD or to study its structure directly, since the cardinality is large and dimension of the 
Euclidean space in which it embeds can be more than three. However, we can understand 
qualitative features of a PCD by analyzing the family of simplicial complexes associated to 
it. We use Vietoris-Rips complexes in this thesis which were introduced first by Vietoris 
then by Rips in connection with the group theory. 

• Morse theory (28] is part of geometric and algebraic topology, whose purpose is to describe 
the topology of reasonably nice spaces X, for example smooth manifolds, with the help 
of nice real valued functions / : X — > R, for example proper smooth functions with all 
critical points nondegenerate. It uses the local changes in the homology of the sub levels 
X-oo,t — to describe the global topology (homology) of the space X. The 

topology of the sub levels for such functions changes for a discrete collection of t's, called 
critical values. If X is compact then the topology changes for a finite collection of t's, say 
to < t\ < • • • < and a finite filtration X_oo,to C • • • C X_oo jtjv = X is defined. 

Persistence theory, as considered by Edelsbrunner, Letscher, Zomorodian [21], P. Frosini and 
C. Landi [22], also anticipated in Rene Deheuvels [IE] , provides a slight change of perspective. 
The object to start with is a space X equipped with a finite filtration rather than a function 
/ : X — > R. This leads to the concepts of persistence vector spaces, linear algebra of persistence 
vector spaces and additional type of invariants, bar codes [6J , [37] . (As recognized by Carlsson and 
Zamorodian [37] the concept of bar codes corresponds to "torsion and rank" of a finitely generated 
graded module over the ring of polynomials of one variable. This concept was not previously used 
in the work of computer scientists.) 

When the underlying space X is a simplicial or more general polytopal complex, algorithms 
of reasonably low complexity [10], [30], [26] as proposed in [21], [37] can be used to calculate 
efficiently the bar codes of the persistence vector space associated with a filtration of the complex. 
From the bar codes, one can derive the homology of the sub levels and then of the underlying 
space. 

Relationship with Data analysis 

The Persistence Theory becomes relevant to Data Analysis due to a remarkable new idea: 
one can understand the qualitative features of a PCD by analyzing geometric shapes, in this 
case simplicial complexes, associated to it. One associates to the finite metric space (E, d) and 
e > a simplicial complex i? e (S) (so called "Vietoris-Rips complex"). Since a single Vietoris-Rips 
complex R e (E) does not contain enough information of (S, d), one considers all of these complexes 
for all e > 0. Fortunately only finitely many of them are different and they form a finite filtration 
of the standard simplex of dimension (jS — 1. The result of this analysis is encoded in bar codes, 
which carry information about the qualitative features of the data and signal the missing parts 
and "accidental noises" in the data. This theory had already nice applications in many areas of 
science cf. Carlsson |5] and one expects much more. 

Section [2] , Mathematical Preliminaries, provides a concise presentation of the mathematical 
concepts behind the Persistence Theory. In this Section we recall the definitions of simplicial 
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complexes, simplicial homology, polytopal complexes, cellular homology, singular homology and 
Betti numbers as well as the methods to calculate them when the coefficient field is Z 2 or R. 
When the field is Z 2 , the calculation is done by the ELZ algorithm [21J. 

When the field is R, the calculation is done by the Hodge decomposition. This is a new method, 
at least as long as the calculation of bar codes is concerned, which is based on the elementary 
Hodge theory [T8] . 

In Section[3], Persistent Homology of a PCD, one reviews the basic algebra of (tame) persistence 
vector spaces and introduces the bar codes of a tame persistence vector space. These are standard 
materials but our presentation is slightly different from the existing literatures. 

From a PCD (£, d) one derives a filtration of simplicial complexes (Vietoris-Rips complexes 
mentioned before) 

Then r-dimensional homology groups of this filtration with linear maps being induced by 
inclusions and coefficients on a field defines a tame persistence vector space. The bar codes of this 
tame persistence vector space is referred to as the bar codes of the PCD. 

When the coefficient field of homology groups is Z 2 , the calculation of bar codes is done by 
the ELZ algorithm. 

When the coefficient field of homology groups is R, the calculation of bar codes is done by 
a slight generalization of the Hodge decomposition as described in Section [2] . This is a new 
contribution. 

So called simultaneous persistence used in the subsequent work on level persistence ( Section 0] 
) is also a new contribution, which requires appropriate modifications / improvements of the ELZ 
algorithm. 

In Section 0] , Persistence Theory Refined, one considers continuous maps / : X — > R. When 
/ is weakly tame (Definition 13. lip , a finite filtration X„ oo to C X_oo <tl C • • • C X_ OQ j N is defined, 
where X^^^ = / _1 ((— oo, ti\) and t < t\ < ■ ■ ■ < tjq are critical values. The homology groups 
in each dimension of the above filtration provides a persistence vector space whose bar codes 
are referred to as the bar codes of the weakly tame map / in that dimension. In this Section 
the "persistence" for such filtration (which is the standard persistence considered in Section [3] 
) is referred to as the sub level persistence and its invariants as the bar codes for the sub level 
persistence of /. The sub level persistence considers the changes in the homology of sub levels. 

In this Section we refine the sub level persistence to level persistence. The level persistence 
considers the changes in the homology of the level sets X t = / _1 (t) rather than the changes in 
the homology of the sub level sets X_ 00tt = / _1 ((— oo, £]). In a more primitive form the level 
persistence was first considered in |17j under the name "interval persistence". For tame maps 
(Definition |4.ip the level persistence is equivalent to zigzag persistence previously introduced in 

Despite the concept of tame maps is less general, almost all continuous maps and in particular 
all maps of practical interest like simplicial maps or generic smooth maps on smooth manifolds 
are tame. Tame maps are weakly tame (Corollary 14.31) . 
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The level persistence determines and is determined by the relevant persistence numbers intro- 
duced in this Section. For a tame map the relevant persistence numbers are equivalent to the bar 
codes for the zigzag persistence, a concept introduced by Carlsson and Silva [11J. We call in this 
Section these bar codes the bar codes for level persistence and also provide their definition. 

For a tame map the relevant level persistence numbers consist of a finite collection of numbers 
while the bar codes for level persistence (equivalently for zigzag persistence) consist of a finite 
collection of intervals of four types: closed, open, left open right closed and left closed right open. 
They carry considerably more information than the bar codes for sub level persistence. In fact 
the former implies the latter as explained and illustrated by examples in Section H] . 

There are two fundamental concepts in the level persistence: death and detectability (or ob- 
servability). These concepts should be compared with birth and death in sub level persistence. 

We also introduce in this Section the concept of positive and negative bar codes since they 
are related to relevant numbers and can be calculated efficiently when the underlying space of the 
map is a simplicial complex and the map is linear on each simplex. 

We reduce the calculation of bar codes for level persistence or of relevant persistence numbers 
to the calculation of bar codes for sub level persistence. However the sub level persistence is 
not for the map / but for other maps associated with /, which are based on the construction of 
cuts along levels. For this purpose we provide new algorithms to calculate " cuts along levels" , of 
interest to computational geometry and improve on existing the algorithms (e.g ELZ). 

This Section also contains a number of examples and pictures to describe the implementation 
of the algorithms. 

The level persistence as presented in this thesis was also considered in papers [4] and [T7] . 
This paper is the Ph.D thesis written under the direction of D. Burghelea at OSU. 
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2 Mathematical Preliminaries 



In this section we review the definitions of simplicial complex / simplicial homology |25] / polytopal 
complex / cellular homology [29J and singular homology and the methods to calculate these ho- 
mologies when the coefficient field is Z 2 or R. 

2.1 Basic Knowledge 

Definition 2.1. Affinely Independent [H] 

A set of points (vectors) Xq, ■ ■ ■ , x%. G M. D is affinely independent iff the vectors 

X\ - X ,X 2 - Xq,-- - ,Xk~ X 

are linearly independent. 

Definition 2.2. Convex Hull [2] 

Let X = {xi, ■ ■ ■ , Xk} be a finite set of points(vectors) in M. D . The convex hull of X is the set 
of all convex combinations of the elements of X 

k k 

H convex (X) = { ^2 a * x i |ai6R,«i>0,^a» = l} 
i=i i=i 

Definition 2.3. Affine Hull [E] 

Let X = {xi, • • • , Xk} be a finite set of points (vectors) in M D . The affine hull of X is the set 
of all affine combinations of the elements of X 

k k 

aff (X) = { a * x i I «i e K, ^ «i = 1 } 
i=i i=i 

Definition 2.4. Geometric Simplexes in a Real Vector Space [31] 

An n-geometric simplex a in M. D forn<D is the convex hull of n + 1 affinely independent 
points t>o, • • ■ , f n £ K D and sometimes denoted by [vq, ■ ■ ■ ,v n ]. The points vq, ■ ■ ■ ,v n are called 
vertices of a. 

The convex hull of a subset of vertices of a containing m + 1 points is called an m-face of a. 
The m-faces of a are also m-geometric simplices. 

Definition 2.5. Standard n-Simplex [25] 

The standard n-simplex A n e M n+1 is the convex hull of vertices 

u = (l,0,--.,0),.-.,« ft =(0,...,0,l). 

Definition 2.6. Geometric Simplicial Complexes [31] 

A geometric simplicial complex K, = { a \ \ i e ^4} in M D is a collection of geometric simplices 
in lk° that satisfies the following conditions: 

1. Any face of a simplex of /C is also in K,. 

2. The intersection of any two simplices 0\ , a 2 G JC is a face of both o"i and <T2. 
2 The homology considered here is always in a fixed field. 
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Definition 2.7. Underlining Space of a Geometric Simplicial Complex [31] 

Let's denote the union of simplices of a geometric simplicial complex JC as |/C| and call it the 

underlying space of /C. 

Definition 2.8. Simplicial Maps Between Geometric Simplicial Complexes [31] 

Let /C and £ be two geometric simplicial complexes. A map / : \K\ — >■ \C\ is a simplicial map 
if it sends each simplex of K, to a simplex of £ by a linear map taking vertices to vertices. 

Definition 2.9. Abstract Simplicial Complexes [3T] 

An abstract simplicial complex X = (V, S) consists of a set V of vertices and a set S of finite 
subsets of V called abstract simplices, which satisfy the following property: if a G S and r C a, 
then r E S. 

If a G 5 has n + 1 elements, the dimension of a is n and cr is called an n-simplex. 
If r C a has m + 1 elements, r is called an m-face of a. 

Definition 2.10. Simplicial Maps Between Abstract Simplicial Complexes [31] 

Let X\ = (Vi,Si) and X 2 = (V^iSb) De two abstract simplicial complexes. A map / : 
V\ — > Vi is a simplicial map if whenever the vertices t>o, • • • ,f n of A\ span a simplex, the points 
f( v o), " ' ' j /( u n) are vertices of a simplex of A2. 

Definition 2.11. Spatial Realization of an Abstract Simplicial Complex [31] 

A spatial realization /C =< X > of an abstract simplicial complex X is provided by an injective 
map / : V — » M D that sends each simplex in to an affinely independent set in M. D . So that no 
interior of different geometric simplices intersect (in particular if f(vi) are all affinely independent). 
The union of the convex hulls of the images of all abstract simplices of X is a geometric simplicial 
complex, which is a spatial realization of X. 

Spatial realizations always exist for D large enough and their underlying spaces are all home- 
omorphic to each other. If we have a geometric simplicial complex /C, we can also get an abstract 
simplicial complex X with V the set of vertices of /C and 

S = { a C V I a are vertices of some simplex of /C }. 

Then (V,S) defines an abstract simplicial complex X. 

It is not hard to check the above assignments are essentially inverses to each other and they 
also induce a bijection between simplicial maps, so an abstract simplicial complex X and its spatial 
realization K, are equivalent objects. 

Definition 2.12. Oriented Simplex (page 26, [3~T] ) 

Let a be a simplex (geometric or abstract). One says that two orderings of its vertices are 
equivalent if they differ from one another by an even permutation. If a is a 0-simplex then, 
obviously, the ordering is unique. If dima > then the orderings of the vertices of a fall into two 
equivalence classes. Each of these classes is called an orientation of cr. An oriented simplex (cr, e) 
is a simplex a together with an orientation e. 

If the points Vq, • • • ,v p are affinely independent, we shall use the symbol 

(vq, ■■■ ,v p ) 
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to denote the oriented simplex consisting of the simplex {v , ■ ■ ■ ,v p } and the equivalence class of 
the ordering (v , ■ ■ ■ ,v p ). 

Definition 2.13. Simplicial Homology with Coefficients in a Field [25] 

Let X be an abstract simplicial complex as defined above. Let C^(X\ k) be the k- vector space 
generated by the set of n-simplices in X, where k is a fixed field. Elements of C^(X;k) are 
finite formal sums EjAjUj where Aj G k, <j\ are n-simplexes of X . By choosing a total ordering of 
vertices of X (which provides an orientation on each simplex) a boundary map : C^(X; n) — > 
C^Li(X] k) is defined as the linear extension of the map below 

i 

It's easy to verify that o d^ +1 = 0, so that we can define the simplicial homology groups as 
H*(A>)=ker^/im«9* +1 . 

Choosing different orderings on the vertices of X or different orientations on each simplex, the 
boundary maps might change (the sign of entries might change) but the homology groups remain 
the same. 

Definition 2.14. Convex Polytope [21] 

An extreme point of a convex set S in a real vector space is a point in S which does not lie in 
any open line segment joining two points of S. The extreme points of a simplex in a real vector 
space are its vertices. 

A convex polytope is the convex hull of a finite set of points. We call extreme points of a 
polytope its vertices. A simplex is a convex polytope such that its vertices are affinely independent. 

Suppose P is a convex polytope with X the set of its vertices. The dimension of P is the 
dimension of the affine hull of X. 

A point x G P is an interior point of P if it is interior in the sense of point set topology when 
P is regarded subspace of aff(X). 

Given a±, ■ ■ ■ ,ao,bEM, consider the closed half-space defined by 

{ (ri, • • • , r D ) G M D | aiT-i + • • • + a D r D < b } 
The boundary of the above half-space is 

{ (n, ■ ■ ■ , r D ) G R D | axn + ■■■ + a D r D = b} 

A face of a convex polytope is any intersection of the polytope with a closed half-space such 
that no interior points of the polytope lie on the boundary of the half-space. 

The dimension of a face is the dimension of its affine hull. Faces of a convex polytope are also 
convex polytopes. Vertices of a polytope are its 0-dimensional faces. 

Definition 2.15. Polytopal Complex [24J 

Polytopal complex, often named as polyhedra complex or cellular complex, consists of a collec- 
tion of convex polytopes in some Euclidean space M D , satisfying two conditions: 
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(i) Every face of a polytope in IT is also in IT. 

(ii) The intersection of any two polytopes in II is a face of both. 

The polytopes in II are called cells. An n- dimensional polytope in II is called an n-cell. 
For example, the set of all faces of a convex polytope defines a polytopal complex. 
A geometric simplicial complex is a polytopal complex in which every cell is a geometric 
simplex. 

Definition 2.16. Oriented Convex Polytope [29J 

Let a be a d- dimensional polytope in M. D and V(a) be the set of vertices of a. e : (^S) — > 
{1,-1} is an orientation of a if the following conditions are satisfied: 

(1) e(t>o, ■ ■ • , v[, • ■ ■ ,Vd) = e(vo, ■ ■ ■ , V{, ■ • ■ ,Vd) if v j and v[ are in the same open half-space of 
MP delimited by one of the supporting hyperplanes of Vo, ■ ■ ■ , V{, ■ ■ • ,v d (hyperp lanes that pass 
through v Q , ■ ■ ■ , Vi, ■ ■ ■ , v d ) and e(v Q , ■ ■ ■ , v' i} ■ ■ ■ , v d ) = -e(v , ■ ■ ■ , v h ■ ■ ■ , v d ) if not. 

(2) e(v Q , vt, ■ ■ ■ , v d ) = sign(7r)e(f 7r ( ), f,r(i), • • • , v n{d )) for every permutation ir. 
O-dimensional polytope admits a unique orientation. Every <i-dimensional(c/ > 1) polytope 

admits exactly two distinct orientations. If e is one of them, — e is the other one. 

An orientation e on a induces an orientation e\ T on its codimension-1 face r. The convention 
is that a base in the supporting hyperplane of r followed by a vector pointing towards the interior 
of cr provides a base of the supporting affine space of a which defines the orientation e. 

Let a be an n-cell, when r is a (n — l)-face of a, one defines the orientation e T by 



for any v , ■ ■ ■ , f n _i G V(r) and v n G V{a) \ V{r). 

Definition 2.17. Cellular Homology with Coefficients in a Field [29] 

Let IT be a polytopal complex as defined above. Let C n (IT; k) be the vector space generated 
by the set of n-cells in II with coefficients in a fixed field k. Elements of C n (IT; k) are finite formal 
sums SjAjfTj where \ G k and Oi are n-cells of II. 

For any cell a choose an orientation e{a). 

The boundary map d n : C n (II; k) — > C n _i(II; k) is defined as the linear extension of the map 
below 



where I a{r = if r is not a face of a and I a{r = ±1 if e(er)| r = ±e(r), e(a) and e(r) are orientations 
of a and r respectively. 

We can verify that d n o d n+ i = 0, so the polytopal homology groups are defined as T-L n (Jl; k) = 
kerd n / imd n+ i. ^^(11; k) is independent of the chosen orientation although the boundary maps 
d n do depend. 

When k = 1*2, orientation becomes irrelevant and a boundary map is defined by 



e|r(^o, • • • , 



v n _i) := e(v , ■ ■ ■ 




T 




where a is an n-cell and cr^'s are (n — l)-cells, A, = 1 iff o~i is a face of a. 
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Note. There is a canonical subdivision of a polytopal complex into a simplicial complex. Each 
cell a can be decomposed into a simplicial complex by induction. Choose an interior point x a 
(apex) of the cell a. The simplices of the simplicial complex a are all simplices of the faces of 
a plus the cones with base the simplices of the faces and with vertex x a . This decomposition is 
possible because all cells are convex polytopes. Once each cell is decomposed into a simplicial 
complex, we can regard the polytopal complex as the union of those simplicial complexes, hence 
simplicial complex. 

Once we decompose a polytopal complex into a simplicial complex, we can calculate the ho- 
mology groups of this simplicial complex. In fact we can use this simplicial homology to replace 
the homology of the polytopal complex, since the underlying spaces of the simplicial complex and 
polytopal complex are the same and two complexes with the same underlying spaces have the 
same homology groups (explained in the paragraph before Definition 12.18ft . 

It is easier to write programs to get the boundary maps and calculate the homology groups 
of a simplicial complex than a polytopal complex. However, it costs much more time to calculate 
the homology groups by first decomposing a polytopal complex into a simplicial complex than 
to calculate directly without decomposition, since the decomposition increases number of cells 
greatly. 

It turns out that the homology of a simplicial or polytopal complex depends only on the 
underlying topological space. The easiest way to see this is to define the homology for a topological 
space, independent of simplices or cells, and show that it leads to the same results as the one 
described above. The homology defined in this way is known as singular homology and is defined 
below. 

Definition 2.18. Singular Homology with Coefficients in a Field(see page 108, 153 [25] ) 
A singular n-simplex in a topological space X is defined as a continuous map a : A n — > X 
on the standard n-simplex (Definition 12 .5 j) . Let C n (X; k) be the vector space with basis the set 
of singular ra-simplices in X and coefficients in a fixed field k. Elements of C n (X; k) are finite 
formal sums SjAjCJj for Aj G G and (Tj : A n — y X. A boundary map d n : C n (X; k) — > C n _i(X; k) 
is defined by the formula: 



where a : A n — > X is a singular n-simplex, vo, ■ ■ ■ ,v n are vertices of A n , Vi means Vi is deleted. 
Implicit in this formula is the canonical identification of [vq, ■ ■ ■ , Vi, ■ ■ ■ ,v n ] with A n_1 , preserving 
the ordering of vertices, so that a\ [v , ■ ■ ■ , Dj, • • • ,v n ] is regarded as a map A" -1 — > X, that is, a 
singular (n — l)-simplex. 

It's easy to verify that d n o d n+ i = 0, so that we can define the singular homology group with 
coefficients in k as 7-L n (X; k) = ker<9 n / im<9 n+ i. 

T-L n (X; k) is a vector space since k is a field. 

Definition 2.19. Betti Numbers 

Given a field k one can define b n (X; k), the n-th Betti number with coefficients in k, as the 
dimension of the vector space T~i n (yX\ Kj. 
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We will show how to calculate Betti numbers of a polytopal complex II with Z 2 or K coefficients. 
When the coefficients are in Z 2 , we have the following method(j2n]) referred to below as the 
ELZ algorithm. 

First order all cells, so that i < j if a, is a face of <jj. Then form a boundary matrix d, so that 
the entry on z-th row and j-th column is 

„r. _ J 1 if <jj is a codimension-1 face of o~j\ 
L ' [0 otherwise. 

The matrix d is an upper triangular matrix with zeros on diagonal. Let low(j) be the row index 
of the lowest 1 in column j. If the entire column is zero, then low(j) is undefined. We call a 
matrix reduced if low(j) ^ low(jo) whenever j and jo, with j ^ j , specify two non-zero columns. 
The algorithm below starts with the boundary matix d and changes it by adding columns from 
left to right. All matrices in this process are upper triangular with zeros on diagonal. Finally we 
get a reduced matrix R. 

Algorithm 2.1. The ELZ Algorithm 

for j = 1 to m do 

while there exists jo < j with low(jo) = low(j) do 
add column j to column j 

endwhile 
endfor 

From the reduced matrix R we can read off the Betti numbers. It can be proven [20J that 
the zero columns of R correspond to generators of cycles and non-zero columns of R correspond 
to generators of boundaries. Since the homology group is quotient of cycles over boundaries, the 
Betti number is the number of generators of cycle minus the number of generators of boundary. 

Therefore the k-th Betti number 

bk(H; I12) = the number of zero columns which correspond to a fc-cycle 

—the number of non-zero columns which correspond to a (k + l)-boundary 

When the coefficients are in IR, we develop a different method using the Hodge decomposition 
introduced below. 

We will see that in Observation 12.71 that the k-th Betti number is 

6fe(II; E) = number of fc-cells in LT — rank(dk+\) — rank{dk). 



2.2 Hodge Decomposition of a Chain Complex with R Coefficients 

The constructions below follow the standard "Hodge decomposition" familiar in Riemannian ge- 
ometry. This finite dimensional elementary formulation was first considered by B. Eckmann |18j . 
One starts with a complex of finite dimensional M-vector spaces 

9r + 2 „ <9r+l „ d r n 9 r -i d 2 n <9i n do n _ „ 
• ■ ■ s- O r+ i >- U r ~" O r _i ■ ■ • >■ L-i *- L/q >■ U_i — U . 
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The spaces in this complex are equipped with positive definite inner products which make the 
standard basis orthonormal. Each complex comes from a finite simplicial / polytopal complex L, 
with C r the R- vector space generated by the r-simplices / r-cells of L and d r the boundary maps. 
d r+ i : C r+ \ C r are linear operators between inner product spaces. Let 5 r = d* +1 be the adjoint 
operator of d r+ \. 

Lemma 2.1. If A and B are two finite dimensional inner product spaces, f : A — )■ B is a linear 
map and f*:B^-Ais its adjoint, i.e. (b, f(a)) = (f*(b), a) for any a G A and b G B, then 

i) ker(f) = !////!/•)) •: 

ii) For any a G A, if f o f* o f(a) = then f(a) = 0. 

Define A r = d r+ i o S r + <5 r _i o d r : C r — > C r for r G Z> , (C r ) + = im((9 r+ i), (C r )_ = im(5 r _!) 
and H r = ker(A r ). 

Proposition 2.2. i) H r = ker(<5 r ) fl ker(d r ); 

ii) (Hodge Decomposition) C r = (C r ) + © H r © (C r )_, where (C r ) + , H r and (C r )_ are 
pairwise orthogonal; 

Hi) There is a canonical isomorphism 

j r : H r = ker(<5 r ) nker((9 r ) — > H r = ker (d r ) / im(d r+ \) . 

We will give an algorithm for calculating the Hodge decomposition, i.e., given a chain complex 
C(L), we will calculate the three orthogonal projections: 

(p r ) + : C r C r wzt/i (p r )+(C r ) = (C r ) + 

ijpr)- '■ C r — > C r with {p r )-{C r ) = (C r )_ 

and 

for each r > respectively. 

Lemma 2.3. Given any m x n matrix A over R ; i/ie ran/c of A is k then there exists a m x k 
matrix [A] of the form 

[A] = (vi_,v 2 , ■ ■ ■ ,v k ) mxk (2.1) 

where {v±,V2,--- ,v k } is a collection of orthonormal column vectors which is equivalent to the 
collection of column vectors of A. 

Two collections of vectors are equivalent if they generate the same subspace. Moreover, there is 
a canonical construction of such orthonormal column vectors known as Gram-Schmidt Orthonor- 
malization. 

Note. 1. Given a linear map A : IR n — > R m and view A as a m x n matrix with respect to the 
standard basis of lR n and R m , the collection of column vectors of [A] represents an orthonormal 
basis of im(A). 2. Matlab contains a function orth with input an m x n matrix A, output a 
matrix [A]. 3. [v4][v4] T is unique although [A] is not (See Lemma 12. 4p . 
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Lemma 2.4. Given a linear map A : W 1 — > M. m , the linear map 

Pa '■ M m -> M. m 
y^[A}[Afy 

is the orthogonal projection on im(A) . 

Proposition 2.5. Given a chain complex CiV) of a finite simplicial / polytopal complex L over . 

9r+2 „ <9r+l „ 8 r n 9r-l 92 n 9\ n 9q „ _ 

• • »- W + i >- U r O r _i 3- • • Oi Oq *~ L'-l — U 



Each d T can be regarded as an n r _i x n r matrix with respect to the standard basis of C r and G r -\. 
The following linear maps are orthogonal projections onto (C r ) +; (C r )_ and H r , respectively. 

(p r ) + : C r ->■ C r 

y H> [d r+ i}[d r+ i] T y 

(p r )_ : C r -»■ C r 

y ^[(9 r f][(5 r f] r |/ 

y ^ (i nr - [d r+1 ][d r+1 f - l(d r ) T }[(d r ) T } T )y 



The linear map 



k r : "H r = ker(9 r )/im((9 r . + i) H r — ker(<5 r ) D ker(<9 r 
y + im(<9 r+ i) h> (p r ) H (y) 



is the inverse of j r , which verifies Proposition 12.21 iii). 

Observation 2.6. The rank of a real matrix A equals to the rank of AA T or A 7 A. 

By the Hodge decomposition, dim(if r ) = dim(C r ) — dim((C r ) + ) — dim((C r )~). By the 
above observation, dim((C r ) + ) = rank([d r+ i] [d r+ i] T ) = rank([d r+ i]) = rank{d r+ \). We get 
dim((C r )~) = rank(d r ) similarly. 

Observation 2.7. The r-th Betti number 

b r (L; M) = dim(H r (L; K)) = dim(C r ) — rank(d r+ i) — rank(d r ). 
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3 Persistent Homology of a PCD 



3.1 Introduction 

Informally, we call a finite set of points X C R m a point cloud data (PCD for short). A PCD can 
be regarded as a finite metric space (see Definition 13. ip . 

A first new idea in Data Analysis is to regard a PCD as a filtration of the standard n-simplex 
A n (n = JjX — 1) via a construction known as "Vietoris-Rips complex" (see Subsection 13.21) . The 
first term of the filtration is the 0-skeleton of A™ or X itself, the last term is A n with the remaining 
components giving an idea of the qualitative features of the set X C R m . 

Another new idea is to use homology to describe the topological changes of the components of 
this filtration (topological features "are born" in some component, and "die" in some other com- 
ponent). This leads to the persistent homology of this filtered simplicial complex. The persistent 
homology provides the tools to measure and explain the qualitative patterns of a PCD. The role 
of the numerical invariants Betti numbers, when the homology of a space (simplicial / polytopal 
complex) is considered, is taken by the invariants bar codes, when the persistent homology of 
a filtered simplicial / polytopal complex is under consideration. The homology considered in this 
work is always with coefficients in a field k (k — Z 2 or R), so the homology groups are actually 
vector spaces. The linear algebra for persistent homology is "persistent linear algebra" discussed 
in Subsection 13.31 The invariant "dimension" for a vector space is replaced by the invariant "bar 
codes" for a persistence vector space. The bar codes provides a complete invariant for a persistence 
vector space (cf. Theorem 13. 5p as the dimension provides a complete invariant for a vector space. 
For a filtered simplicial / cell complex as opposed to simplicial / cell complex, we will have "bar 
codes" derived from persistent homology as opposed to Betti numbers derived from homology. 
The bar codes can be explicitly calculated by algorithms of the same complexity as the algorithms 
used for the calculation of the Betti numbers. For k = Z 2 , the matrix reduction and pairing 
algorithm [20] referred below as the ELZ algorithm is the one we will use. For k = R, we will use 
elementary Hodge theory to produce a new algorithm for the calculation of the bar codes. 

Subsection 13.21 discusses PCD and Vietoris-Rips filtration. Subsection 13.31 discusses persistent 
linear algebra, including persistence vector spaces and their bar codes. The essential features 
of a persistence vector space are the concept of "birth and death time" of its elements and 
the fact that the "bar codes" provides a complete invariant which carries significant information 
about "birth and death time" of its elements. Subsection 13.41 defines persistence vector space 
% r {K) and the bar codes B(fC) of a filtered simplicial / polytopal complex K and uses the Hodge 
decomposition to calculate B(fC). Subsection 13.51 gives algorithms to calculate the bar codes of a 
PCD with coefficients k = Z 2 or R. In case of the field Z 2 , the algorithm was introduced in [2Tj . 
Subsection 13.61 shows a numerical experiment of the bar codes of a PCD with coefficients k = R . 

3.2 PCD, Vietoris-Rips Complex and Filtration 
Example 3.1. A PCD in M 2 with 5 points (see Figure [1] below). 



3 Time here means the index of the component of the filtration. 
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R €a (X) 



Figure 1: A PCD in M 2 with 5 points 

Informally, we call a finite set of points X C M m a point cloud data (PCD for short). See 
example above. The inclusion X C ]R m defines a finite metric space (X, d) with d being the 
Euclidean distance in K m . From mathematical point of view we will use the following definition 

Definition 3.1. A point cloud data is a finite metric space X = (X, d). 

Let X be a finite metric space. Given e > 0, the Vietoris-Rips complex R e (X) of PCD X 
has X as the set of verticies. A fc-simplex is any subset of vertices a = [xo,Xi, . . . ,Xk] with the 
property that d(xi,Xj) < e for all pairs Xi,Xj G a. One obtains in this way an Abstract Simplicial 
Complex. 

Notice that Vietoris-Rips complex will be determined by its one skeleton. 
If e < e' then there is an inclusion R € (X) R e \X). 

Since a PCD X is a finite set, Vietoris-Rips complex R e (X) will change at only finitely many 
epsilons 

= e < ei < ■ • • < ejv = sup (d(x, x')). (3.0) 

x,x'ex 

We define the filtration of Vietoris-Rips complexes of PCD X as 

R eo (X)CR ei (X)C...CR eN (X). (3.1) 

In this filtration, R eo (X) is a zero dimensional simplicial complex with #(X) vertices and 
R tN (X) is a — 1 standard simplex. 

Example 3.2. Filtration of Vietoris-Rips complexes of the PCD in Example 13.11 
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The invariant we will calculate will be a collection of intervals for any r, < r < dimR eN (X). 
Each interval will have as ends 6q, ■ ■ ■ , ejv, +00 or equivalently 0, 1, • • • ,N, +00 with the convention 
that % indexes the number 6j. 

If we are interested in the bar codes for r < m — 1, one can work with R e (X, m), the m-skeleton 
of R e (X). The m- restricted filtered simplicial complex 

R eo {X,m) C R ei {X,m) C ... C R €N {X,m). (3.2) 

has the same persistent homology and bar codes as the original filtration (13. II) up to dimension 
m—1 (see subsection 13.41 for the definition of persistent homology and bar codes ). This supposes 
a much smaller amount of data to be stored in a computer and permits to calculate all bar codes 
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for r < m — 1 . 

Given a positive integer number P(< iV), we can make a further restriction of the filtration 
(13. 2 p by stopping at level P 



R eo (X, m) C i? £1 (X, m) C ■ ■ • C R tp (X, m) 



(3.3) 



This will permit the calculation of all bar codes for r < m — 1 with both ends less than P or 
the bar codes for r < m — 1 with left end less than P and right end > P. 

3.3 Persistent Linear Algebra (a gentle introduction) 
Definition 3.2. 1) A persistence vector space V is a sequence 

On, W : K, -> K+i I n e Z> } 

with vector spaces over a field k and y? n linear maps. 

A persistence vector space is tame iff each V n has finite dimension and ip n is an isomorphisms 
for n large enough. 

The main features of this concept are "birth and death time" Q for elements and information 
about these is provided by the bar codes. 

2) A linear map of persistence vector spaces u : V — > W, where V = {V n , <f n '-V n ^ V n+ i \ n 6 
Z> } and W = {W n , <fi n : W n —> W n+ i \ n G Z> } is a commutative diagram 



tfin-l 



Vn 



^~ * 71.- 



'n+1 — 



1 



where a; n is a linear map from to W n for each n > 0. 

A linear map of persistence vector spaces u : V — > W is an isomorphism if there exists another 
linear map of persistence vector spaces u' : W — )■ V such that u'oy : V -)• V and o u;' : W — > W 
are identities. V and W are isomorphic if there is an isomorphism between them. 

The existence of a linear map u : V — > W so that each component u n : V„ — > W n is an 
isomorphism for every n > implies w is an isomorphism. One takes u' n = (w n )~ l . 

3) Let U = {U n ,ip n : U n -> f/ n+ i | n G Z> }, V = {K,y2 n : 14 -> V n+X \ n G Z> } and 
W = {Wn, 0n : W n — >■ W n+ i | n G Z> } be persistence vector spaces. A short exact sequence of 
persistence vector spaces 







U 



V 







is a sequence of linear maps of persistence vector spaces such that 



U, 



fin 



is short exact for all n > 0. 



*Time here means the index of the component of the filtration. 
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The short exact sequence *■ U >- V — v —^ W >- splits if there exits a linear map 

a : V — ?■ U such that a o /i — identity or if there exists a linear map j3 : W — > V such that 
v o (3 — identity. 

Note. The alternative definitions are equivalent. Given a : V — > Li such that a o /j, — identity, 
we can define j3 : W — > V such that v o j3 — identity and vice versa. 

i)Let a : V — > U s.t. a o /i — identity. For any x G W n , there exists x' G such that 
v n {%') — x - Define 

x i — y x 1 — fi n o a: n (V). 
To see that /3 n is well defined, i.e. independent of the choice of x', consider 

7„ : V n ->U n @ W n 
x h-> (a n i, z/ n x). 

7„ is injective. Indeed if x G ker(7„) hence a n x = and v n x = 0. Since im/x n = kerz/ ra , there 
exists y £ U n such that x = fi n {y)- Then y = a n o fi n (y) = a n (x) = 0, so x = /x n (0) = 0. 

If x" G 14 such that v n {x") = x, then 7 n will map both x' — n n o at n {x') and x" — /x n o a n (x") 
to (0,x). Since 7„ is injective, x' — /i n o a n (x') = x" — /i n o a n (x") which shows that f3 n is well 
defined. 

It is straightforward that u n o (3 n = identity and the following diagram is commutative 



n+l 



/3n 



1 



n » n+l 

ii)Let j3 : W — >■ V with v o (3 = identity. For any x G \4, let y s = x — (3 n o v n (x), then 
v n (y x ) = and there exist unique z x G // n such that /i n (zx) = 
Define 

Oin- V n -¥ U n 

a n is well defined since for each x, y x and z x is unique. 

It is straightforward to check a n o /i n = identity and the following diagram is commutative 

V n > V n+ i 



On + l 



TT TT 

4) Direct sum of a finite collection of persistence vector spaces V 1 — {V£, tp l n : V£ — > V^ +1 \ n G 
Z> }, % G A(A finite), is defined by V = {V n , <p n :V n -> V n+1 \ n G Z> }, where V n = 

ieA ieA 

(p n = (J^y?n for n > 0. Note that a direct sum of a finite collection of tame persistence vector 
spaces is tame. 
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Observation 3.1. 1) Given a split short exact sequence 







-u 



V 







we have V = U © W, where U,V,W and fx, v are the same as in Definition ^. 2\ 3). 
2) Not any short exact sequence is split. 

Proof. 1) Since the two conditions for a short exact sequence to be split are equivalent, WLOG, 
we can assume that there exists a linear map (3 : W — > V such that v o f} — identity. Define the 
linear map 

5 n : U n © W n -»■ 14 
(y, z) h-)> + 

for all n > 0. 

It is easy to check the following diagram is commutative 

u n © W n ( ^U n+1 © w n+1 



v„ 



Given any element x6V„. Let z = ^ n (a;), then u n (x — (3 n z) = z — z = and there exist unique 
y EU n such that fi n fy) = x — (3 n z. 

Then 5 n (y, z) = ii n y + {3 n z = x — f3 n z + f3 n z = x and 5 n is surjective. 
Let (y, z) e ker 5 n , then + (3 n z = 0. 

Since = v n ([i n y + /3 n z) = v n o //„(?/) + z/ n o = + z = z, we have z = 0. 
Then /i n y = implies f/ = 0, since ii n is injective. 
Therefore (y, z) = (0, 0) and 5 n is injective. 
Hence 5 n is an isomorphism for all n > 0. 

Therefore 5 : W © W — > V is an isomorphism and V = U @ W. 
2) Counterexample: 

Consider the following short exact sequence: 



V 



I a 




where 



ii : K — >• K © K, p 2 : M © M — >■ R, c: K © M ^ M © 1 
ar i->- (ar, 0) (x,y)t-+y (x, y) H> (y, 0) 
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If the sequence splits, in view of ceo// = identity, a±(x, 0) = x, which makes the commutativity 
of the diagram 

TTT) 



ao 



ai 



impossible. 

□ 

Notation 3.3. Define the following tame persistence vector spaces as basic tame persistence 
vector spaces. 

1) K,[t] is the tame persistence vector space over a field k, 

On, (p n :V n -> V n+1 | n E Z> } 

where V n — k and tp n = identity for all n > 0. 
It corresponds to the interval [0, oo). 

2) S r K,[t] is the tame persistence vector space over a field k 

{V n , (fn'-V n ^ V n+ i | n e Z> } 

where V n = for < n < r , V n = k for n > r , = for < n < r and <p n = identity for 
n > r. 

It corresponds to the interval [r, oo). 

The notation S r indicates the right shift with r-units. 

3) T r+ ift[t] is the tame persistence vector space over a field k, 

{V n , (p n : V n ->■ K+i I n E Z> } 

where 14 = k for < n < r , V n = for n > r , ip n = identity for < n < r and ip n — for 
n > r. 

It corresponds to the interval [0, r\. 

The notation T r+1 indicates the truncation at level r + 1. 

4) STp+iK;^] is the tame persistence vector space over a field k 

On, ^n : K -> K+i | n G Z> } 

where V n — k for r < n < r + p and V n = otherwise; <^ n = identity for r < n < (r + p — 1) and 
</? n = otherwise. 

It corresponds to the interval [r,r + p]. 

Notation 3.4. For tame persistence vector space V = {V n , ip n : V n — > V n+ i \ n e Z> } 

1) Denote <^jj = y^'-i ' • •ov'i : K — * Vj f° r * < 3 an d V 9 *,* = identity : — )■ V^, with i,j e Z> . 

2) Denote f3(i,j) = dim(im(</?j ) j : — )■ V,)) with i < j £ Z> . 

Note (3(i,i) = dimVi and /3(i,j) = j3(i,j + 1) for j large enough. If </? n is an isomorphism for 
n > N, denote oo) = j3(i, m), where m is any integer larger than i and N . 
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en 



Definition 3.5. Define an order -< of all basic tame persistence vector spaces: 

1) S r K[t] -< S r 'K[t] if r < r'; 

2) S r T p+ in[t] -< S r T p ' + in[t] if r < r' or (r = r' and p < p'); 

3) S r T p+1 K[t] -< S r 'K[t]. 
Clearly, this order is a strict total order. 

Lemma 3.2. If V and W are two basic tame persistence vector spaces such that V -< W, £/i 
any map from V to W is trivial. 

Proof. We check situation 2) of Definition 13.51 first. 

Suppose V = S r T p+ in[t] and W = S r T p ' + in[t] and there is a linear map a; : V — > W. We want 
to show lo = 0. 

We have 2 cases: 

Case 1: r < r' 

In this case, w n = for all n < r or n > r + p, since = 0. 
For r < n < r + p, consider the following commutative diagram: 

id 

K >■ ft 



0— VW„ 



We have u n = u n o id = o u r = 0. 
Hence, a; = 0, if r < r'. 
Case 2: r — r' and p < p' 
Consider the commutative diagram: 



V+p' 



id 

K ft 



for r < n < r + p. 

The proofs of situations 1) and 3) of Definition 13.51 are similar. 



□ 



Proposition 3.3. Any tame persistence vector space V = {V n , tp n : V n — > V n+ i\n £ Z>o} over a 
/ieid k zs isomorphic to 

S"Mt] © ^T n . +1 «[t] 
i<j<p i<i<<; 

where p, q, r i; and n^ G Z> . 

5o, any iame persistence vector space can be decomposed in to a direct sum of a finite collection 
of basic tame persistence vector spaces. 



20 



Proof. Since V is a tame persistence vector space dim(V^) is finite for all n > and there exists 
N > such that y? n is an isomorphisms for n > N. 
Define L(V) = ^ dimK- 

0<n<JV 

We prove by induction on L(V). 

If L(V) = 0, then V = 0. We are done. 

If Liy) ^ 0, then V k ^ for some < fc < iV, with V t = if % < k. 
WLOG, suppose Vo ^ and choose a nonzero element i>o from Vo. 
Define f n = v?o,n(^o), n > 0. 

Case 1: v n ^ for < n < j and v n = for n > j, where j > 1. 
Note. We must have j < AT, otherwise v n ^ for all n > 0, a contradiction. 

Consider the following "short exact sequence" of tame persistence vector spaces 



W K 




aj-i 



->■ K ■ 



Ctj-2 a j-l 
K 







fj-2 



<Pj-l 



h 



7-2 



fi-1 



9 j-2 



fj-2 



Vj-i 



<fij-l 



I hj fj + i 







V; 



■3 + 1 
<Pj + l 



V Vo/ KVq >- Vi/ KVi 9 Vj-2/ >" Vj-l/ KVj-i > Vj 



9j-i 



•fij-2 



9j 
<Pj-l 



9j+l 



fj 



V j+ i 



fj+1 



where 

a n = identity for < n < j — 2 and a n — for n > j — 1; 

/ n : /c ->■ K /or < n < j - 1 

A I—)- Af „ 
and /„ = for n > j; 

9n- V n -> V n / KV n for < n < j - 1 

X H-» X + „ 

and g n = identity for n > j; 

p n : 14/ «u n ->■ K+i/ «u n+ i /or < n < j - 1 

x + /tf„ !->■ v? n (x) + /tw„+i 
and = </?„ for n > j. 

The above diagram is commutative and is a short exact sequence of tame persistence vector 
spaces. Next we will show the above short exact sequence splits. 
Extend Vj-i to a basis {xi = Vj-i, x 2 , ■ ■ ■ , x p } of Vj-i. 
Given any element x G Vj-i, we have a unique expression 



x 



E 



QjjXi 



Define a linear map 



hj-i : Vj-\ — > k 

Y7i=l a i X i ^ a l 
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For < n < j — 1, define a linear map 



h n : V n -> k 

X H->- hj-l O (/7 nJ _i(x) 

For n > j, define h n = 0. 

Clearly h : V — > W is well defined and ho f = identity. 
Therefore the above short exact sequence splits, and 

= T jK [t] © V 

Hence L(V') = L{V) - j < L{V). 
Case 2: t> n 7^ for all n > 0. 

Consider the following "short exact sequence" of tame persistence vector spaces 




a N -2 



■ K ■ 



ojv-i 



■ K 



• K 



cw+i 



/jv-i 



<PN-2 



h 



JV-l 



Sjv-i 



I h N 



/iVH 



V l/ / *■ V\ j KVi >■ • • s- 

where a n = identity , 



9n 



fN+l 



9N+1 



VjV-l/ K^jV-l > V/jv/ KUjy — *- VjV+l/ K v N+ i 



VJV+i 



/„ : : V n ->V n /KV n 

A 1 — y Xv n x I—?- x + n 

y? n : V n /KV n ->■ K+l/^n+l 

x + /w„ (-)■ v?„(x) + kw„ + i 
for all n > 0. 

We will show the above short exact sequence splits. 
Extend vn to a basis {yi = f at, 2/2, •' ' > 2/?} °f Viv- 
Given any element y G Vjy, we have a unique expression 

q 

3=1 

Define a linear map 

fiN '■ Vn — > k 
For < n < N, define a linear map 

x H> h N o (p n>N (x) 
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For n > N, define a linear map 

h n . 

x^h N o ip^-Jx) 



Clearly h n o f n = identity and 

K ■ 



h„ 



V n — V n+ i 

is commutative for all n > 0. 

Therefore the short exact sequence in Case 2 splits, and 

= «[t] ®v 

Hence L(V') = L(V) - (JV + 1) < L(V). 



In both cases L(V) < L(V). 
By induction on L(V), 



S ri K[t] © 5^T n . +1 K[t] 

l<j<s l<j<t 



Proposition 3.4. Let 



and 



□ 



v= s^f]© ^T n . +1 K[t] 

V' = S<k[*] © S m ^T n , +1 K[t] 
l<i<p' l<j<q' 

IfV = V , then p = p' , q = q' and r% = r\, rrij = m'p Uj = n'j after a suitable permutation. 

Proof. Reorder components in V and V in increasing order (See Definition 13. 5p and group all 
copies of the same basic tame persistence vector space together into isotypical components, so we 
have 

w = w» 

l<n<a 

and 

w = W' n 

l<n<b 

Precisely each isotypical component of W or W' is a direct sum of isomorphic basic tame 
persistence vector spaces. 

We only need to prove that a = b and W n = W for 1 < n < a = b. 
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Define S(W) = a, the cardinality of the isotypical components of W. We will prove the 
statement by induction on S(W). 
If S(W) = 0, clearly W = = W. 
If S{W) > 0, write W = Wi © R and W = Pi © P', 

Where P = © 2 <n<«^n, A = ©l<n<6iW^, # = ®b 1+ l<n<bW^ 

bi is an integer such that W' n < W\ for 1 < n < b\ and W' n > W\ for n > b\. 
Here the order of W n and W' n is determined by the order of their basic components. 
Since W — W, there is a pair of isomorphisms u> : W — > W and u/ : W — » W such that 
uj o u>' = id and w'oy = zd. 

Write w : W W matrix 

VKi P 
Pi [AC 



R \ B D 



R \ D 
Pi P' 

w l ( A' a 



Since any component of W\ < any component of R, by Lemma 13.21 B = 0. 
So matrix form of a; is 

Wi R 
Pi (AC 



Similarly, uj' has matrix form 



R \ D' 
Since u o u' = id and w' o u = id, we have 

D ) ( P' ) = 7 and ( D' ) ( P ) = L 

So AA' = J, A A = J, DD' = I, D'D = I. 
Then W x = Pi and R ^ P'. 

Since Wi and Pi are isomorphic, each basic component of Pi must be isomorphic to the basic 
component of Wi. Their number in Wi and Pi should be the same. 
So Wi = Pi. 

Since R = R and S(R) = a — 1 < a, by induction we finish the proof. 

□ 

Definition 3.6. Bar codes is a finite collection of intervals 

M 

with i G Z>o J € Z> U{°°} an d * — J- 

Given a tame persistence vector space V, there exist a decomposition 

V ^ S r *K[t\® S^T nj+l K[t] 
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by Proposition 13.31 

Then assign bar code 

B(V) = {[n, oo], [m^rrij + n 3 -]|l < i < p, 1 < j < q} 

to V. Call it the bar code of the tame persistence vector space V. 
This bar code B(V) is unique by Proposition 13.41 

Theorem 3.5. Two tame persistence vector spaces are isomorphic iff their bar codes are the same. 

Proof. Theorem 13.51 is obtained directly from Proposition 13.31 and Proposition 13.41 □ 

Observation 3.6. j3(i,j)=number of intervals in B(V) which contain for i 6 %>o,j £ 
Z>o U{°°} an d i — 3- I n particular, dim(Vi) =number of intervals in B(V) which contain {i} 
for i > 0. 



Proof. Suppose 



v s sr*K[t] © s m iT nj+1 K[t\ 4 w 



Since V = W, there exist an isomorphism / : V — > W and following commutative diagram: 



W„ 



V, 



n+l 



w n+1 



/n + l 



Since /j : im(<pij : — > V}) — > im( 



'1,3 



: Wi —¥ Wj) is an isomorphism, (3(i,j) = dim(ira( 



Observe that 0jj is a direct sum of linear maps tz %d > k or >k or k ^0. (3(i,j) 



dim(im0j J ) is the number of linear maps k ld > k . Each k ia > k corresponds to an interval in 
the bar code B(W) that contains 

Therefore 0(i,j) = the number of intervals in bar code B(V) that contains [i,j] 

□ 

Definition 3.7. Given a tame persistence vector space V, define fi{i,j) = number of intervals in 
B(V) which equal to 



id 



Observation 3.7. 1) J% 



- 0(i - 1, j) - + 1) + 0(i - 1, j + 1) < % < j < oo 

P(0,j)-P(0,j + 1) i = 0,0< j<oo 

oo) — /3(i — 1, oo) < z < oo,j = oo 

/3(0, oo) i = 0, j = oo 



i<i, m>j 



fj,(l, m) 
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Proof. 1) In the case < % < j < oo, we have 

rthi)= #{reB(V) \r=[i,j]} 

= #{r G B(V) | r D - #{r G B(V) | r D [i - 

-#{r G B(V) | r D [i, j + 1]} + #{r G B(V) | r 2 [i - 1, j + 1]} 
= p(i,j) - (3{i - 1, j) - /3(i, j + 1) + - 1, j + 1) 

The second identity in the above identities holds because 

{reB(V)|r=[i,j]} 
= {re B(V) | r D - {r G B(V) | r D [i - 1, j]} - {r G B(V) | r D [i, j + 1]} 

The other three cases are easier to prove. 
2) Follows directly from definition. 

□ 

Definition 3.8. Denote k(i,j) = dim(ker(<^jj : Vi — » Vj)), k(i) = dim(Vi) with % < j G Z> . 
Observation 3.8. 1) 

j) = k(i) - k(i,j). 

From the above observations, we see that the set of numbers {(3(i,j)} and {k(i,j), k{i)} 

are equivalent. Sometimes, it is more convenient to define and calculate bar code using the num- 
bers k(i,j) and k(i) instead of (3(i,j). 

Edelsbrunner-Letscher-Zomorodian's Interpretation [21] 

Let V be a tame persistence vector space (see Definition 13. 2p . 

One says that the nonzero element x G Vj ; is born in Vi, i < j hence "is born at time i" if it is 
in the image of (pij and not in the image of (pi-ij and dies in V^, k > j + 1, hence "dies at time 
j" if y^-fctz) = but (p j>k -i 0. 

A subset S C V, V vector space, is called linearly independent if its elements form a collection 
of linearly independent vectors in V. If 5^ C Vi is linearly independent in V, and (pij(Si) is linearly 
independent in Vj, then we call Si linearly independent on interval 

With this in mind, (3(i, j) is the maximal cardinality of linearly independent sets on the interval 
and = the maximal cardinality of linearly independent sets on the interval whose 

elements are born in Vi and die in Vj+i- 

Each interval in B(V) guarantee the existence of an element in Vi, which is born in V,, 
survives in each V r (i < r < j) and dies in Vj+i. 

In [21] the authors propose to collect as points in the extended half space HR : = 

{(x,y) GM 2 Ui?xoo|s< y}. With this convention Cohen-Steiner, Edelsbruner, Harer [13] 
have established a strong stability result associated with a real valued map(cf. Stability Theorem, 
page 182 in [20]) which relates the distance between two real valued functions and the bottle neck 
distance of their persistence diagrams. 

Zomorodian - Carlsson's Interpretation [37] 
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Definition 3.9. 1) A «;[t]-module V is the vector space over k equipped with a linear map 
A : V — > V. The module action is defined as 

K[t] xV 

2) A fi;[t]-module V with a linear map A : V — > V is finitely generated if there exists 
{v i, t>2, ■ ■ • , f r } such that any u G V is a linear combination of 

{vi, v 2 , ■ ■ ■ , v r , A( Vl ), A(v 2 ), • • ■ , A(v r ), A 2 ^), A 2 (v 2 ), • • ■ , A 2 (v r ), ■■■}. 

3) A graded «[t]-module V is a /c[i]-module together with a decomposition of the vector space 
V = V n and a linear map A : V — > V such that A(v n ) G Ki+i, Vt> n G V^. 

n>0 

4) Let V = with the linear map A : V — > V and W = W n with the linear map 

n>0 n>0 

£> : V — > V be two graded modules. 

A morphism of graded «[i]-modules / : V — > W is a linear map of vector spaces such that 
f(V n )CW n and fo A = Bof. 

A morphism of graded K[t]-module / : V — > W is an isomorphism if there exists another 
morphism of graded rc[£]-module g : W —> V such that g o f : V — > V and / o g : W — > W are 
identities. V and W are isomorphic if there is an isomorphism between them. 

One of the main results of [37J is the equivalence of the category of finitely generated «[£]- 
modules and the category of tame persistence vector spaces. 

Proposition 3.9. Finitely generated graded n[t]-modules identify to tame persistence vector spaces 
and so do their morphisms. 

Proof. 1) Let V = V n be a finitely generated /^-module with a linear map A : V — » V such 

n>0 

that A{v n ) G Vn+u \/v n G V n . 

There exist {vi, v 2 , ■ ■ ■ ,v r } such that any v G V is a linear combination of 

(*) {vi, «2, • • • , v r , A( Vl ), A(v 2 ), ■ • • , A(v r ), A 2 ( Vl ), A 2 (v 2 ), ■■■ , A 2 (v r ), ■■■}. 

Since each Vi is a sum of homogeneous components, WLOG, we can assume {v\,v 2 , • ■ • ,v r } 
themselves are homogeneous, i.e., i>j G V ni , 1 < i < r. 

Let Hi be the degree of i>j , 1 < i < r. WLOG, we can assume rij < n i+ i. 
Let ft = A\ Vn : V n ->■ K+i, then 

V = {K, ft ■ V n -> Kj +1 |n G Z> } 

is a persistence vector space. 

To see that V is tame we denote by iVo = max n.j. Clearly dim V^(n < iVo) is finite and in view 

l<i<r 

of (*) the linear maps ft )n ,+i: — ft+fc-i ° ft+fc-2 ° • ■ • ° ft are surjective hence dim V n is finite for 



27 



any n > N . Since tp n is surjective for any n > N and dimVAr is finite, there exists A" so that 
dim V n is constant for n > N. Since any surjective map between vector spaces of the same finite 
dimension is an isomorphism, the linear map ip n is an isomorphism for n > N. 
Hence, V is a tame persistence vector space. 

If V = {V n , ip n : V n —> V n+ i | n G Z> } is a tame persistence vector space, define the graded 
fc[£]-module V = V n with linear map A = ip n : V — > V. 

n>0 n>0 

There exist A^ > such that ip n is an isomorphism for n > 0. Let {vi,v 2 , • • • ,v r } be the set 
of all generators of V\, V 2 , ■ ■ ■ , Vat, then any v G V is a linear combination of 

{ui, v 2 , ■ ■ ■ , v r , A( Vl ), A{v 2 ), ■■■ , A(v r ), A 2 { Vl ), A 2 {v 2 ), ■■■ , A 2 {v r ), ■■■}. 



2) From definitions of morphisms of finitely generated graded «;[t]-modules and linear maps of 
tame persistence vector spaces, we can see that two finitely graded K[t]-modules are isomorphic 
iff the tame persistence vector spaces associated with them are isomorphic. 

□ 

Recall that the ring is a principal ideal domain and a basic theorem in algebra [27] claims 
that any finitely generated modules over a principal ideal domain decompose uniquely. In partic- 
ular, any finitely generated modules over K[t] decomposes uniquely as a finite direct sum of k[£]'s 
and T d .re[t]'s. As noticed by [37], the above result extends to finitely generated graded modules 
where the free module n[t] is to be replaced by S ,r fi;[t] for some r and torsion module T d n\t] by 
S r TdK.[t] for some r and d. Notice that the module S r A has the component (S r A) p = A p ^ r for 
p > r and equal to zero for p < r. Note that each component S' r K[t] corresponds to a bar code 
[r, 00) and each component S r T d+1 n[t] corresponds to a bar code [r, r + d](cf. [37]). 

Quiver Representation Perspective 

As noticed by G. Carlsson and Vin de Silva [UJ, persistence vector spaces which stabilize for 
n > N can be regarded as representation of the oriented graph 

•1 *- # 2 " # 3 • " *n-i ^ *N 

Any such representation is a sum of indecomposable representations which are classified by the 
intervals ,1 < i < j < N. The interval with j < N correspond to bar code while 
the interval [i,N] to the bar code [i,oo). The interval corresponds to the representation 

0i"i r\ iid %d %d r\ 
1 ^ U2 ■ ■ ■ Uj_i >■ Ki >■ K i+ i • ■ ■ 9- Kj Uj + i ■ ■ ■ 

This is a result in the theory of quiver representation due to P. Gabriel[23j. 
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3.4 Persistent Homology and Bar Codes of a Filtered Simplicial/Polytopal 
Complex 

Definition 3.10. 1) A filtered space X consists of a space A and a finite filtration 

X CX l C---CX N = X. (*) 

2) Sometimes it is convenient to suppose that X is embedded in Aoo which is contractible. 
For a filtration without A^, we can complete it with A^ = C(X), the cone over X. Then (*) 
becomes 

l ciiC..-ci w = ici oo . (**) 

Example 3.3. Filtrations of Vietoris-Rips complexes (13.11) - (13.31) associated to a PCD X with 
N +1 points in R m provide examples of filtered simplicial complexes where I ro = R €N (X) = A n . 

Below are two other relevant examples. 

Definition 3.11. A continuous map / : X — > R is called weakly tame if X is compact and 
there exists finitely many values min(/(X)) — to < t\ < ■ ■ ■ < tjsf — max(/(A)) (so called critical 
values) so that: 

(i) for any t the closed sublevel X_ OQ t is a deformation retract of an open neighborhood, 

(ii) for any i and t, t 6 [tjti+i), X_ OCj ^ t retracts by deformation to X_ 0Oiti . 

Informally this means that each sublevel is homotopically well behaved (neighborhood retract) 
and the topology(homotopy type) of sublevels change only for finitely many £'s. 

Example 3.4. A weakly tame map / : X — » R with critical values to < ii < • ■ • < tjv provides 
another example with A 7 , = / _1 ((— oo, £*]), <i<N, X^ = CXn, where CXn is the cone with 
base Xat. 

The properties (i) and (ii) are sufficient hypotheses to ensure that in each dimension the 
homology vector spaces X t . provide tame persistence vector spaces. 

Example 3.5. There is a natural filtration of an A-dimensional simplicial complex or polytopal 
complex, the skeleton filtration 

A C X 1 C ■ • • C X N = X C X°° 
where X'(0 < i < N) are i-skeletons of A and A°° = CX. 

This subsection defines and studies persistent homology and bar codes of filtered spaces of 
finite simplicial/polytopal complexes. 

Given a filtered space JC of a finite simplicial/polytopal complex K 

K C K x C • • • C K N = K. (3.4) 

we can associate a commutative diagram defined below. 
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Definition 3.12. Denote by the K-vector space C r (K s ) with basis the r-simplexes / r-cells of 
K s , d^. : C s r — > Cy_j the boundary map from C r {K s ) to C r -x(K a ), i s r : C s r —> C^ +1 the linear map 
induced by the inclusion from K s to K s+ i ( clearly i s r is one to one ). Let C s r = C^, <9* = and 
i s r = identity when s > N, one obtains the commutative diagram 



aO 

U M+1 



"m 



J M+l 



c 



M 



qN 
U M+1 



■N+l 



J r+1 



a 



V-l 



°r + l 



ct 



i"- 1 

V-l 



a* 

W-l 



°r-l 



°r + l 



qN + 1 
u r+l 



■N-l 
V-l 



of 

°r-l S" 3 "" °r-l 
— 1 



V-l 



•JV+1 



6f +1 

■N+l 
N+l r- 1 



aiV+1 
°r— 1 



(3.5) 



9? 



^0° 



'(i 







*0 



at 



o 



.JV-l 
*0 



9f 



a 



,'iV 

AT *o 



giV+1 

• AT+l 

r<N+i l o 



JV+1 











Note. Since if is finite, each row is a tame persistence vector space and = for r > dim(K). 

Definition 3.13. 1) Passing to homology with coefficient in a field k, i.e. consider 'W T = 
ker(d*)/im(<9^ +1 ), i s r induces the linear maps % s r \'K s T ^f l-i s r +l . For each r(> 0) 

8+1 \ 



n r (JC) := {n 8 r X ■ K -> K |« G Z >o) 



is a persistence vector space with bar code B(H r (JC)). 



-s,s+p 



: Hi — > 1-Lt +P ) is referred to as 



Following [21] . the collection of vector spaces = im(2 r 
the persistent homology. 

2) The collection of bar codes B(T-L r (JC)) for all r is denoted by B(JC) and referred to as the 
bar code of /C. 



Suppose K, is a filtered simplicial / polytopal complex as in H3.4[) 

C ATi C • ■ ■ C = AT, 
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we calculate the bar codes of K in two = Z 2 and k = R. We consider only filtered 

simplicial complex below, filtered polytopal complex can be dealt with similarly. 

Case k = Z 2 

In this case, we can calculate the bar codes of K. using the persistent algorithm given in [20] . 
There are two steps of this algorithm: 1) matrix reduction; 2) pairing. 
Matrix reduction (algorithm (ELZ)) 

Suppose S(K) is the set of all simplices of K and #S(K) = m. 

Consider the function f in( i : S(K) — > {0, 1, • • ■ , N}, such that fi n d(o~) = i for a G Ki \ K^\. 

Choose a compatible ordering of the simplices, that is, an ordering of all simplices of K, such 
that a -< r if fi n d{o~) < fmd(T) or if a is a face of r. After we order all simplices of K according 
to this compatible ordering, we get a sequence of simplices a%, <7 2 , • - • , o~ m . 

Consider the m-by-m boundary matrix d given by 

., _ J 1 if (Tj is a codimension— 1 face of cr,; 
L J |^ otherwise. 

Let low(j) be the row index of the lowest 1 in column j. If the entire column is zero, then 
low(j) is undefined. We call R reduced if low(j) ^ low (jo) whenever j and j , with j ^ j , specify 
two non-zero columns. The algorithm reduces d by adding columns from left to right. 

Algorithm 3.1 

for j = 1 to m do 

while there exists j < j with low (jo) = low(j) do 
add column j to column j 

endwhile 
endfor. 

The running time is at most cubic in the number of simplices [26], [30]. In matrix notation, 
the algorithm computes the reduced matrix as R = d ■ V . Since each simplex is preceded by its 
proper faces, d is upper triangular. Since we only add from left to right, V is also upper triangular 
and so is R (See page 152-153, |20j for details). 

Pairing 

Once there are two cases of columns of reduced matrix R. 

Case 1: column j of R is zero. We call o~j positive since it creates a new cycle and thus gives 
birth to a new homology class unless it dies in the same filtration. 

Case 2: column j or R is non-zero. We call o~j negative because it provides the death to a 
homology class. 

See page 154-155, [20] for details. 

We can read bar codes directly from the matrix R. Suppose column j is zero and dim(cjj) = r. 
If there exists a column k with low(k) = j and find(°~k) > find(°~j) then column j provides a 
closed interval [find(o~j), find(o~k) — 1] £ B(H r (K)). If there exists a column k with low(k) = j and 
find(o~k) = find(o~j) then column j doesn't provides any interval since it indicates a cycle which 
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dies in the same nitration. Note that fi n d{.&k) < find{&j) is impossible according to the definition 
of compatible ordering. If there is no column k with low(k) = j then column j provides an infinite 
interval [f ind (aj),oo] G B(W r (fC)) . 

Case k = R 

For each s consider the chain complex C(K S ) with coefficients in R 

9 r + 2 9 r + l 8 s dj-l So 8? dn 

• " " Ly r i =*" Ly r _^ " • • Ly-^ *~ Oq — U 

equipped with the scalar products defined by the standard basis(see subsection 12. 2p . 
Let 

(1) 5*_ x = (d?.)* the adjoint operator of <9; for r G Z> , 

(2) A* = d s r+l o S s r + 8^ o d s r : C s r -> C r s for r G Z> and 

(3) (C;)+ = im(^ +1 ), (C r s )_ = im(^_ 1 ) and ii r s = ker(A^), with 

if r s = ker(5 r s ) n ker(<9 r s ) 

and 

c; = (c s r ) + © if; © (c;)_ 

where (C*)+, if; and (C;)_ are pairwise orthogonal. 

Recall that [A] represents the orthonormalization of matrix A(see Lemma [2.3p . With respect 
to the standard basis provided by simplices / cells, <9; can be regarded as an nf._ l x n s r matrix. 
In view of the above considerations, the following linear maps are orthogonal projections onto 
(C*)+,(C*)_ and if;, respectively. 

(#)+: (■■; -+c; 

y h- [<9; + iP; +1 ] T y 
UC) ■ c° >(■■; 

y mrmrvv 

(p s r ) H : C s r ^C s r 

y ^ (i K - [d° r+1 w r+1 ] T - mrmmv 



(3.6) 



H s r = ker(8 s r ) n ker(«9 r s ) H s r = ker(«9;) /im(<9 r s +1 ) 
x i — y x ~t~ im(<9; +1 ) 



and 



k 8 r : = ker(<9 r s )/im(d; +1 ) -> ii r s = ker(5 r s ) n ker(^) 
y + im((9 r s +1 ) i-). (p s r ) H (y) 
is a pair of isomorphisms between ff; and 

Since ffj identifies to W r by the pair of isomorphisms f r and k s T , the persistence vector space 

H r {K) = {H°,g s r : H S T -> ff r s+1 | s G Z> }, 

where #; = fc; +1 oijo j;, is isomorphic to 

H r {K) = {niX ■ K ->• ^r +1 I S G Z> } 

By Theorem EH one has B{H r {K)) = B{U r {K,)). 
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Notation 3.14. 1) Denote 



g^ = g t-l ... og s. H s^ H t 



for s < t and 



5V 



s,s 



identity : H* -> H* 



with s, t G Z> . 

Define i*'* : — >■ C* and i*'* : "H^ — >■ "H* in similar way. 

2) Denote /? r (s,t) = dim(im(^'* : ->■ if*)) with s < t G Z> . 

Note. /3 r (s,s) = dim# r s and /3 r (s,t) = f3 r (s,t + l) for i > N. 
Hence (3 r (s, oo) = /3 r (s, m) for m larger than s and iV. 

Notice that (3 r (s, t)=number of intervals in B(H r (JC)) which contain [s,t] for < s < t < oo. 
(See Observation 13. 6p 

3) Denote /x r (s, t)=number of intervals in B(H r (KL)) which equal to [s,t] for < s < t < oo. 
We have 



Note. From the above formula and note in 2) we only need to know /3 r (s, t) for all < s < t < N 
in order to calculate fi r (s,t) and the bar code B(H r (fC)). 

Theorem 3.10. j3 r (s, s) = rank({p s r ) H ) and (3 r (s,t) = ranfc((p*)# o ip f o (p s r ) H ) for s <t, where 



For the definition of (p s r )H, refer to \2. 1\) and $3. 6\) . 

We will use M ATLAB and in order to avoid the use of the function "rank" which is not reliable 
for large matrices we will need the following observations to complement Theorem 13.101 

Observation 3.11. The rank of a real matrix A equals to the number of positive eigenvalues of 
AA T or A T A. 

Observation 3.12. j3 r (s, s) = dim(H^) = dim(C^) — rank(d s r+1 ) — rank(d^.). 

Observation 3.13. If j3 r (s,t) ^ 0, s < t, then 

i) H l r is nonzero for all s < i < t; 

ii) r (s, i) is nonzero for all s < i < t . 

Discussion of the Simultaneous Persistence 




/3 r (s, t) - f3 r ( S - 1, t) - r (s, t + 1) + f3 r {s - 1, t + 1) 



< s < t < oo 
s = 0,0<t<oo 
0<s<oo,t = oo 
s = 0, t = oo 



(3.7) 
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In the case k = Z 2 we describe how to calculate the bar codes of Z 2 -homology groups of the 
filtered simplicial complex 

K CK 1 C-..CK N = K. 

Let's discuss a more general case where the filtration has two directions. 

Suppose X is a polytopal complex and X ± C X are two subcomplexes with X = X + fl X~ . 
Suppose X + and X~ are equipped with finite filtration Xq" C X^ C • • • C X^ + and Xq C X-f C 
• • • C X~_ where Xq = X , X~_ = X~ and X+ + = X + . Define the simultaneous persistence 
number u r (s, t) of the above filtration to be the maximal number of linearly independent elements 
in H r (X ) which dies in X~ at s and in X + at t(See [I] for details). 

One can put the cells of X in three groups I, II, III. The group I contains the cells of Xq, the 
group II the cells of X~ \ X and the group III the cells of X + \ Xq. 

Consider the following filtration of X 



Xq C Xi C • • • C X 



N-+N+ 



X 



+ N + . 



+ l<i<N- 
m. 

- + N + }, such that f(a) 



for a G 



where X = X" for < % < N~, Xi = X~ U XJ_ N _ for N~ 

Suppose S(X) is the set of all cells of X and #S(X) = 

We can define a function f in d : S(X) — > {0, 1, • • • ,N~ 
Xi \ Xj_i. 

We use a compatible ordering of the cells, that is, an ordering of all cells of X, such that a -< r 
if find{.°~) < findij) or if a is a face of r. After we order all cells of X according to this compatible 
ordering, we get a sequence of cells U\, cr 2 , • • • , cr m . 

With this compatible ordering, the incidence matrix M is given by 



M 



A B- C + 
5 
C 



A 


B- 





B 



with A being the incidence matrix for the subcomplex Xo, M 

being the incidence matrix for X 



being the incidence 



matrix for X and M + 



A C + 
C 

The matrix M is said to be in relative reduced form if both M~ and M + are in reduced form. 
This relative reduced form R re i can be achieved by applying Algorithm 2.4.1 to matrix M. 



We can read the number u>. 
the number of all triples 
low(k') = low{k") = k, o~k 



,j) directly from the relative reduced form R rei of M. u r (i,j) 



0V, <7fc» 

eX-\Xr 



) with <Tfc G Xq, column k of R re i is zero, dim(o"jt; 
4 and o k „ e X+\X+. 
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3.5 Calculate the Bar Codes of a PCD 



Given a point cloud data X = {x±, x 2 , ■ ■ ■ , x p } G R n (suppose all points in X are distinct), we will 
first consider its distance matrix 

/ d(xi,xi) d(x\,x 2 ) ■■■ d(xi,x p ) \ 
d(x 2 ,x 1 ) d(x 2 ,x 2 ) ■■■ d(x 2 ,x p ) 

d(^Xpj d(^Xpy '^-'2) • • • diyOc rp ^ x J 

Notice that the D is a symmetric matrix, the diagonal elements of D are zeros and all the other 
elements are positive real numbers. 

Let S be the set of the entries in D and order these numbers increasingly (since they will be 
our epsilons), so we have 

£:0 = e <e 1 <e 2 <---<e7v. 

£ is exactly (13. 01) . 

Given a positive integer m and P, let's compute the bar code of PCD X up to 
dimension m — 1 and step P(for definition see (13.31) ) . 

To simplify the notation, from now on we will write Ki instead of R €i (X,m). Kq is just a 
collection of vertices {x\, x 2 , ■ ■ ■ , x p }. 

We will abbreviate the vertex Xi simply by the number i , the edge [sc^,^] by [io,«i] and 
fc-simplex [x io , ■ ■ ■ ,x ik ] is represented by [i ,ii, - ■ ■ , i^]. 

We want first to have the information about the filtered simplicial complex stored as 

(a) An (m + 1) x (P+ 1) matrix named Dimension where the entry Dimension r+l s+1 is the 
integer n s r = dimension of C s r (See Definition 13.121) . 

(b) An (m + 1) array of matrices named Simplex. The component Simplex r+1 is a matrix 
describing r-simplices of Kp\ precisely is a matrix of r + 1 columns, each row representing an 
r-simplex [io,«i, • • • ,ik]- These rows are ordered consistently with the filtration. The order for 
simplices of K s \K s _i is however arbitrary. 

There is an obvious way to create these data, which is not very efficient but worth to mention. 
Given K s , we can easily find its 1-simplices from the distance matrix D. All the nonzero numbers 
in D which are less than e s correspond to the edges in K s . In order to find the r-simplices, we scan 
all possible combination of r + 1 numbers of {1, 2, ■ • • ,p}. [i , i 1 , ■ ■ ■ , i r ] (i < i x < ■ ■ ■ < z r )belongs 
to the r-simplices of K s iff every pair belongs to the 1-simplices of K s . As s increases, we 

have more r-simplices in K s . Store each rsimplex as a row in Simplex r+1 in filtration order. In 
another word, if [zo,«i, • • • , v] belongs to K s but not in then it is "new" and can be added 

as a new row to the rows provided by the r-simplices of K s _i. 

We use however the package JPlex [lj [34J to compute Dimension and Simplex , which can 
be download from http: / / comptop.stanford.edu/programs/jp lex/| 

Once we get Simplex, we get the standard basis of C^, which is stored in Simplex r+1 , for 
< r < m. Given Simplex,, and Simplex r+1 , we compute <9 T P with respect to the standard basis: 
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Write Simplex r+1 as 

( On ' ' ' a l,r+l ^ 

\ a nP,l ■ ■ ■ Q<,r+1 J 

and Simplex,, as 

/ Oil ■ • • h,r \ 

where n^ 3 and can be obtained from Dimension. 

is an /i^_i x n p matrix with element (d^^ij equals to the incidence number of the simplices 
[6^1, • • • , bi >r ] and [a^i, • • • , % jr+ i]. If [b it i, • • • , b^ r ] is not contained in [a^i, • • • , aj tr+ i], (d^)ij = 0. 
Otherwise, [b^i, • • • , b^ r ] equals to [a^i, • • • , a^, • • • , a J>r+ i] for some k, where means deletion. 
If k is odd, (dr)ij — 1; if k is even, {d^)ij = — 1. 
(9^ is the n£_! x upper-left block of 9^. 

Once Dimension and Simplex are determined, we apply Theorem 13.101 and the Observations 
13.111 13.121 and [3T31 to get (3 r (s,t) and then )U r (s, t) according to (13. 7p . 

Here are the brief description of functions, all of them are written in Matlab( for details see my 
program). Go to http: //www. math, ohio-state.edu/~du. 50, download ' 'Computing Bar code of PCD.z 
Then unzip it and put the folder in Matlab direction. 

Initial Data: 

A p x n matrix X, which stores p points x\, ■ ■ ■ , x p in IR n as rows, two positive integers m 
and S. m is the given upper bound of dimension of restricted Vietoris-Rips Complex. S is the 
given upper bound of steps of filtration. P = min(S, N). See (13. 3p . 

1. function getDistance 
Input: X. 

Output: A p-by-p upper triangular matrix D. Dy = d(xj, Xj), 1 < i < p — l,i + 1 < j < p. 

2. function scaleX 
Input: X, D. 
Output: X. 

Given a point cloud data X, we find all the distances between two points and sort them as 
e < ei < ■ ■ ■ < cat. Note: If e i+ i — Cj is very small(less than 3 • diam(X) • eps, the upper 
bound of round off errors, where diam(X) is the maximal distance between any two points of X, 
eps = 2.2204 x 10~ 16 is floating-point relative accuracy in Matlab), we delete £j + i. We scale X 
such that the minimum difference between two consecutive distances is larger than 3 x 10 -4 . For 
details and more explanation, read the comments in my program. 

3. function getEpsilon 
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Input: D, S. 

Output: Two positive integers P = min(S,N) and e = tp. Two row vectors epsiorg, 
which stores different distances eo < t\ < ■ ■ ■ < of D in increasing order and epsiavg, which 

stores 

e o + e i e fc + e fc+i £n-i + £n 1 
2 '"' ' 2 '"' ' 2 ' 6iV 2" 



4. function getDimensionSimplex 
Input: X, P, epsiavg, m and e. 
Output: 

a) An integer m a = dim(R ep (X, m)). 

b) An (m a + 1) x (P + 1) matrix Dimension. Dimension r+1 s+1 = n s r) the dimension of 

c;. 

c) An m a + 1 array Simplex. 

The component Simplex r+1 is a matrix with r + 1 columns, which stores r-simplex of 
R tp (X, m) as rows in filtration order. 

Note: Here we use the package edu. Stanford. math. plex.* of J P lex [T] [34]. 

5. function getDeltaP 

Input: Dimension, Simplex and m a . 

Output: An m a array DeltaP. DeltaP r = the matrix of : — > C^_ x with respect to 
the the standard basis of and C^_ v 

6. function getDelta 

Input: DeltaP, Dimension, r, s 

Output: The matrix d^. : C s r — > C^_ 1 with respect to standard basis of C s r and C^_ l . 

7. function getRank 
Input: P, m, and m a . 

Output: An (m + 1) x (P + 1) matrix R. R r+1 s+1 = rank of d^,. 

Algorithm: R r+l s+1 = number of eigenvalues of d* x d s r ' which is greater than 10~ 8 . Here 
we use Observation 13. 1 ll 

8. function getDimensionHrs 
Input: Dimension, R, P, m and m a . 

Output: An integer m c and an (m c + 1) x (P + 1) matrix dimHrs. dimHrs r+l s+1 = 
dim(H°). 

Algorithm: dimHrs r+ i s+ i = Dimension r+ i s+ i — R r+ 2,s+i — Rr+i,s+i- 
Here we use use Observation 13.121 



9. function getHarmonicProjection 
Input: Dimension, r and s. 
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Output: The harmonic projection matrix (pp) u from C s r to itself with respect to the standard 
basis of Cp 

Algorithm: See ( 13. 6p . 

10. function getBeta 

Input: Dimension, P, dimHrs and m c . m c is the maximal dimension of nonzero H^s. 
Output: A (P + 1) x (P + 1) x (m c + 1) matrix Beta. Beta(s + 1, t + 1, r + 1) = /3 r (s, t). 
Algorithm: 

/ dim(#°) dim( J ff 1 ) • • • dim(# p ) \ 
_ )= dim^ 1 ) ••• dim(ffjf) 

\ dim(# p ) J 

(3 r (s,s) = rank({j> s r )H) = dim (i/;!) (see Observation 13 . 1 2p : 
(3 r (s,t) = rank^pDx o ip* o {p s t )h) for s < t (see Theorem 13. 101) . 
Apply Observation 13.111 when computing rank as in function getRank. 
Apply Observation 13.131 to avoid unnecessary computing. 

11. function getMu 
Input: Beta, P and m c . 

Output: A (P + 1) x (P + 1) x (m c + 1) matrix Mu. Mu(s + 1, t + 1, r + 1) = /i r (s, t) 
Algorithm: Apply (13.71) . 

12. function getBar codeMatrix 
Input: Mu, P, and m c . 

Output: A, a q x 4 matrix which stores the bar code in increasing order according to 
dimension, left endpoint and right endpoint. M intervals [a, b] in bar code B(7i r (}C)) will be 
represented by [r a b M] in A. 

13. drawBar code 
Input: A and P. 

Output: A picture of bar code stored in barcode.eps. 

14. function main 
Input: X, S and m. 

Output: A matrix A stored in A. mat and a picture of bar code stored in barcode.eps. 
Algorithm: Run above functions except 6. and 9. in order. 

3.6 Numerical Experiment 

We have a PCD in M 2 with 53 points: 

The restricted filtered simplicial complex of this PCD ( m = 4, P — 35) is 
And bar codes of the above filtered simplicial complex are 
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R eo (X) 



• • • 



Figure 3: A PCD in R 2 with 53 points 



39 




Figure 4: Restricted filtered simplicial complex of this PCD (m = 4, P — 35) 
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Figure 5: Bar codes of the above filtered simplicial complex 
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4 Persistence Theory Refined 

In this section we consider refinements of standard persistence for a real valued map and describe 
the calculation if its invariants in the case of a linear map / : X — > R, X a simplicial complex. In 
order to understand the novelty we will review in subsection 14.11 the standard persistence named 
in this section sub level persistence. 

Let / : X — > R be a continuous map, denote by 

X^ t :=r\{-oo,t]) 

X t>0O := /-*([*, oo)) 
X t := r\t) and 
X att :=f-\[s,t]),8<t 

4.1 Sub Level Persistence [3 J 

Given a continuous map / : X — > R, the sublevel persistent homology introduced in [21] and 
further developed in [37] is concerned with the following questions: 

Ql. Does the class x G HfiX-^t) originate in H r {X- <Xi fii') for t" < P. Does the class 
x G Hf^X-ooj) vanish in H r {X_ 00it t) for t < t'l 

Q2. What are the smallest t' and t" such that this happens? 

The information that is contained in the linear maps Hf^X-^t) — > iJ r (X„ 00jt /) for any t < t' 
is known as sub level persistence. 

Let / x G H^X-ooj)). One says that 

(i) x is born at t", t" < t, if x is contained in img(ii r (X_ 00)t //)) — >■ // r (X_ 00)t ))) but is not 
contained in img(if r (X_ oo t //_ e )) — >■ i? r .(X_ 00jt ))) for any e > 0. 

(ii) x dies at if > t, if its image is zero in img(if r (X_ oo t )) — > H r (X_ 00tt >))) but is nonzero 
in img(F r (X_ OC)t )) ->■ i^pr^^))) for any < e < f - i. 

(iii) x dies at oo, if its image is always nonzero in img(H r (X_ oo t )) — > H r (X_ OCJ)t i))) for any 
t' > t. 

The standard construction "telescope" in homotopy theory permits to replace any finite filtered 
space Kq C K \ C • • • C by a weakly tame map / : X — > R (cf. Corollary 13. llj) . simply by 
taking X = K x [t ,ti] i^i x [ti, t 2 ] U0 2 ••• i^jv-i X [t N -i,t N ] K N where 0; : 

^ x -> x is the inclusion and /|jf jX [t i ,t j+1 ] the projection of x [t^, on 

\Pi, ti+l]- 

When / is weakly tame, the sublevel persistence for each r = 0, 1, ■ • • , dimX is determined by 
a finite collection of invariants referred to as bar codes for sublevel persistence [37J. The bar 
codes for sublevel persistence are a collection of intervals of the form [t,t') or [t, oo) with t < t'. 

The number of r-bar codes which identify to the interval [t, t') is the maximal number of 
linearly independent homology classes in H r (X_ 00 ^ t ), which are born at t, die at t' and remain 
independent in img(H r (X_ oo t ) — > H^X-m,*)) f° r an Y s, t < s < t' . 

The number of r-bar codes which identify to the interval [t, oo) is the maximal number of 
linearly independent homology classes in iJ r (X„ oo t ) which are born at t, never die and remain 
independent in img(if r (X_ 00jt ) — >■ /f r (X_oo )S )) for any s > t. 
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Lemma 4.1. The bar codes for sublevel persistence are finite collections of intervals of the form 
[ti,tj) or [t i} oo) with ti < tj being the critical values of f . 

Proof. Indeed the ends of the "bar codes" have to be critical values. Suppose there is an r-bar 
code of the form [t, t'), we have to < t < t' < t^. Suppose U <t < t i+ i for some i and tj <t'< tj + \ 
for some j. 

There exists a class x G H^X^^^) that is born at t and dies at t' . 

Since the canonical inclusion X_oo jt . •— y X^^t is a deformation retraction and induces an 
isomorphism on homology groups, we have x G img(if r .(X_ 00jti ) — y Hr^X-ooj)). If t > ti, it will 
cause a contradiction with the fact that x is born at t. So we have t = ti and the bar code is of 
the form [tj, t'). 

Consider the commutative diagram below 

X— oo, ti *" X— oo,tj 



oo.tf 



Since the canonical inclusion X_ OQ <t . ^-y X_ 00tt > is a deformation retraction, the vertical map 
in the above diagram an isomorphism. The image of x in img(H r (X_ 00tt .) — y H r (X^ 0Ojt:j )) is zero. 
If t' > tj, it will cause a contradiction with the fact that x dies at t' . So we have t! = tj and the 
bar code is of the form [ti,tj). 

For the same reason, an r-bar code [t, oo) is of the form [t i; oo). The finiteness of the collection 
of r-bar codes follows from the finiteness of dim.H r (X^ 00it ). 

□ 

From these bar codes one can derive the Betti numbers r (t, t'), the dimension of img(iif r (X_ 00)t ) 
H r {X_ oo t i)),ioi any t < t' and get the answers to questions Ql and Q2. For example, 

P r (t,t') = the number of r-bar codes which contain the interval [t, t'}. 

From the Betti numbers (3 r (t, t')one can also derive these r-bar codes. 

Denote // r (tj, i/)=number of r-bar codes which equal to [U,tj) for to < ti < tj < oo, where to 
is the smallest critical value. We have 

r (ti, tj-i) - Pr{U-l, tj-l) - Pr(U, tj) + Pr(U-l, tj) t < U < tj < OO 

, , , ) (3r(to, tj-l) ~ Prito, tj) U = to, t < tj < OO , , 

/,; ' ! ' ~ ^ /3 r (ti,00) -^(ti_!,00) t <U < OO, tj = OO 1 > 

/3 r (to, OO) U = t , tj = OO 

The computation of the bar codes for a filtration of simplicial / polytopal complex is discussed 
in subsection 13.41 with coefficient field of homology groups being Z 2 or M. 
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4.2 Level Persistence [3] 

Level persistence was first considered in [17] and was better understood when the zigzag persistence 
was introduced and formulated in [12]. Given a continuous map / : X — y R, level persistence is 
concerned with the homology of the fibers H r (X t ) and addresses questions of the following type. 

Ql. Does the image of x £ H r (X t ) vanish in H r (X ty t>), where t' > t or in H r (X t »j), where 
t" < tl 

Q2. Can x be detected in H r (X t /) where t! > t or in H r (X t ») where t" < tl The precise 
meaning of detection is explained below. 

Q3. What are the smallest t' and t" for the answers to Ql and Q2 to be affirmative? 
To answer such questions one has to record information about the following maps: 

H r (X t ) ^ H r {Xt, v ) <- H r {X v ). 

The level persistence is the information provided by this collection of vector spaces and linear 
maps considered above for all t, t'. 
Let / c 6 H r (X t ). One says that 

(i) c dies downward at t' < t, if its image is zero in img(H r (X t ) — > H r (Xt',t)) but is nonzero in 
ivag(H r (X t ) -»■ H r (X t ' +e ,t)) for any < e < t — f. 

(ii) c dies upward at t" > t, if its image is zero in img(H r (X t ) — >■ H r (X t ^n)) but is nonzero in 
img(H r (X t ) H r (X tjt r,_ e )) for any < e < t" - t. 

We say that x £ H r (X t ) can be detected at tf > t, if its image in H r (X tjt /) is nonzero and is 
contained in the image of H r (X t >) — > H r (X t)t t). Similarly, the detection of x can be defined for 
t" < t also. 

Definition 4.1. A continuous map / : X — > R is called tame (Definition 3.5, [4J) if X is compact 
and there exists finitely many values min(/(X)) = to < ti < ■ ■ ■ < = max(/(X)) (so called 
critical values) so that: 

(i) for any t ^ to, ti, • ■ ■ , t^ there exists e > so that / : f^it — e, t + e) — » (t — e, £ + e) and 
the second factor projection X t x (i — e, t + e) — >■ (£ — e, t + e) are fiber-wise homotopy equivalent. 

(ii) for any t^ there exists e > so that canonical inclusions X t . <^-> X t . t . +(E and X tj X t ._ e j. 
are deformation retractions. 

This means that the homology group of the level set changes at finitely many i's. We have 

Lemma 4.2. Given t^\ < s < t{, X t ._ l S and X Siti retracts by deformation onto X t ._ 1 and X t . 
respectively. 

Proof. We will prove that X S)ti retracts by deformation onto X ti _ 1 . The proof that X s>ti retracts 
by deformation onto X ti is essentially the same. 

For any a, < a < s, there exists an e a > such that / : J" 1 (a — e a , a + e a ) — > (a — e a ,a + e a ) 
and the second factor projection X t x (a — e , a + e a ) — >■ (a — e a , a + e a ) are fiberwise homotopy 
equivalent. Hence for any a — e a < t" < t! < a + e a , we have X t ".t' retract by deformation onto 
Xfi. For a, ti-\ < a < s we can choose < e a < min(a — £j_i, s — a), so that (a — e a , a + e a ) is 
contained in (ij_x,s). 
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For ti-i, there exists an e ti _ x > so that canonical inclusion X t . •— > X titti+et , is a deformation 
retraction. 

The union of open intervals (a — e a , a + e a ) for t^i < a < s covers the closed interval s]. 
Then finitely many of them (a,k,bk), 1 < k < m cover [ij_i,s]. Suppose these intervals are not 
contained one in another, otherwise we can delete it. Observe that (U-i — eti-nU-i + et i _ 1 ) and 
(s — e s , s + e s ) have to be contained in those finite collection of intervals. 

Order (a,k,bk), 1 < k < m according to their right endpoints increasingly, we have bj < bk, 
when j < k. Observe that b^-i G (afc,6fc) and G (afc_i,6fc_i). Therefore we also have Oj < a^, 
when j < k and the intersection of (a k _i,b k _i) an d (a k ,b k ) is always nonempty. Since both left 
endpoints and right endpoints are ordered increasingly (ai,&i) = (U-i — e t ._ 1 ,t i _ 1 + e tj _J and 
(a m ,6 m ) = (s - e s ,s + e s ). 

Choose any c fc G (a fe ,6 fe ) fl (a fc+ i,& fc +i), 2 < k < m - 1. Let c = Ci = + e <i _ 1 and 

We have X Ck _ uCk retracts by deformation onto X Ck _ 1 for any 1 < k < m. 
Hence X ti l)S = X COiCm retracts by deformation onto X tl _ x = X CQ . 

□ 

Corollary 4.3. Tame maps are weakly tame. 

Note that the tame maps form a generic set of maps in the set of all continuous maps on a 
simplicial complex or on a smooth manifold (and more general on a compact ANR). 

Lemma 4.4. Given tj_x < s < t < t i} X s>t retracts by deformation onto X s or X t . 

In case of a tame map the collection of the vector spaces and linear maps is determined up 
to coherent isomorphisms by a collection of invariants called bar codes for level persistence 
which are intervals of the form [t,t f ] with t <t' and (t,t f ), (t,t'\, [t,t') with t < t' . 

These bar codes are called invariants because two tame maps / : X — > H. and g : Y — > K. which 
are fiber-wise homotopy equivalent have the same associated bar codes. The above result was 
established for the zigzag persistence from which can be derived, but can also be proven directly. 
However the proof is not contained in this paper. 

An open end of an interval signifies the death of a homology class at that end (left or right) 
whereas a closed end signifies that a homology class cannot be detected beyond this level (left or 
right). 

There exists an r-bar code (£",£') if there exists a class x G H r (X t ) for some t" < t < t' which 
is detectable for t" < s < t' and dies at t" and t' . The multiplicity of (t", t') is the maximal number 
of linearly independent classes in H r (X t ) such that: 

(i) all remain linearly independent in img(H r (X t ) — > H r (X tyS )) for t < s < t' and img(H r (X t ) — > 
H r (X S:t ))fort"<s<t; 

(ii) all die at t" and t'. 

Notice that the change of t above will not affect the multiplicity of (t",t'). 

There exists an r-bar code (t", t'] if there exists an element x G H r (X t i) which is not detectable 
for s > t' and detectable for t" < s < t' and dies at t" . The multiplicity of (t",t'\ is the maximal 
number of linearly independent elements in H r (X t >) such that 
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(i) neither one is detectable for s > t'; 

(ii) all remain linearly independent in img(H r (X t /) — > H r (X s j>)) for t" < s < t'\ 

(iii) all dies at t" . 

There exists an r-bar code [t", f) if there exists an element x G H r (X t ») which is not detectable 
for s < t" and detectable for t" < s < t' and dies at t' . The multiplicity of [t",t') is the maximal 
number of linearly independent elements in H r (X t ") such that 

(i) neither one is detectable for s < t"; 

(ii) all remain linearly independent in img(H r (X t ") — > H r (X t >', s )) for t" < s < t'; 

(iii) all dies at t'. 

There exists an r-bar code [t", t'] if there exists an element x G H r (X t n) which is not detectable 
for s < t" or s > t' and detectable for t" < s < t'. The multiplicity of [t", t'} is the maximal number 
of linearly independent elements in H r (X t n ) such that 

(i) neither one is detectable for s < t" or s > t'; 

(ii) all remain linearly independent in img(H r (X t //) — > H r {X t n s )) for t" < s < t'. 

Lemma 4.5. In case of a tame map, the bar codes for level persistence are finite collections of 
intervals of the form [U,tj\ with U < tj critical values and (ti,tj), {U,tj\, [ti,tj) with ti < tj critical 
values. 

Proof. Using Lemma 14.21 and Lemma I4.4[ the proof is similar to Lemma 14.11 

□ 

Notation 4.2. Given a tame map / : X — > M. with critical values to < • ■ ■ < t^, denote by 

BL r (f) := the bar codes for level persistence with respect to r-th homology groups. 
N r (ti,tj) := the number of intervals {ti,tj) in BL r (f). 
N r (ti,tj] := the number of intervals (t{,tj] in BL r (f). 
N r [ti,tj) := the number of intervals [ti,tj) in BL r (f). 
N r [ti, tj] := the number of intervals [ti,tj] in BL r (f). 

Hence #BL r (f) = N^U,^) + N r {t h t 3 ] + N r [ti, tj) + N r [t h tj}. 

In Figure El we indicate the bar codes both for sublevel and level persistence for some simple 
map in order to illustrate their differences and connections. The class consisting of the sum of 
two circles at level t is not detected on the right, but is detected at all levels on the left up to (but 
not including) the level t! . 

Level persistence provides considerably more information than the sublevel persistence [1] 
and the bar codes for the sublevel persistence can be recovered from the bar codes for the level 
persistence. An r-bar code [s, t) for level persistence contributes an r-bar code [s, t) for sublevel 
persistence. An r-bar code [s,t] for level persistence contributes an r-bar code [s, oo) for sublevel 
persistence, r-bar codes [s,t] and {s,t) for level persistence contribute nothing to r-bar codes for 
sublevel persistence. An r-bar codes (s, t) for level persistence contributes an r + 1-bar code [t, oo) 
for sublevel persistence. See Figure |6] and Lemma [4761 below. 
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Figure 6: Bar codes for level and sub level persistence 
Lemma 4.6. Given a tame map f : X — > R with critical values to < t\ < ■ ■ ■ < tj$. We have 

N 



Hr(ti, OO) = ^N r [t u ti 



l=i 



for any critical values ti < tj . 



The bar codes for the level persistence can be also recovered from the bar codes for the sub 
level persistence but from the bar codes of a collections of tame maps canonically associated to /. 
This will be described in the next subsection. 

Definition 4.3. For a continuous map / : X -> E and t" < t < t', let L r (t) := H r (X t ), L+(t; f) := 
ker(H r (X t ) -> H r (X ti1f )), L;(t;t") := ker(H r (X t ) -> H r (X t „, t )) and I r (t,f) := img(H r (X t ) 
H r (X t>t ,)) nimg(H r {X tl ) -» H r {X tjt ,)). 

Define the relevant level persistent numbers 

l r (t) := dimL r (t) 

l+(t;t') := dim L+ (*;*') 
l;(t;t") := dim L;(t;t") 
e r (t; t") := dim(L+(t; f) n L r T(t; *")) 
z r (M') := dim(/ r (t,0) 
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Observation 4.7. For a tame map we can derive relevant level persistent numbers from the 
numbers N's of bar codes for level persistence as follows 

Proof. For t" < t < if 

l r (t) = number of intervals in BL r (f) which contain t; 
i r (t,t') = number of intervals in BL r (f) which contain [£,£']; 

Z+(t;0= Y N r%h)+ Y ^rfe.ti); (4-2) 
ti<t<tj<t' ti<t<tj<t' 

KW)= Y N r(ti^}+ E N rfatiY> ( 4 - 3 ) 
t"<ti<t<tj t"<ti<t<tj 

e r (t;t',t") = Yl N r(ti,tj). (4.4) 

t"<ti<t<tj<t' 

□ 

Theorem 4.8. For a tame map we can get l r (t), l+(t;t'), l~(t;t") and e r (t;t',t") from i r (t,t') . 
Proof. 

l r (t) = i r (t, t). 

N r (t k , tj) = i r (t", t') - i r (t k , if) - i r (t", tj) + i r (t k , tj) 
for any t" < tf such that t k < t" < tk+i, tj-i <t' < tj. 

N r [t k , tj] = i r (t k , tj) - i r (t", tj) - i r (t k , t') + i r (t", if) 
for any tk-i < t" < t k and tj <t'< tj+i- 

N r (t k , tj] = i r (t", tj) - i r (t k , tj) - i r (t", t') + i r (t k , t') 
for any t k < t" < t k+ i and tj <t'< tj+i- 

N r [t k , tj) = i r (t k , t') - i r (t k , tj) - i r (t", t') + i r (t", tj) 

for any t k _i < t" < t k and £ 3 -_j <t'< tj. 

Plug in equation g^D, g^D and (g3D we get /+(£;£')> KW) and e r (t;t',t"). 

□ 

Theorem 4.9. For a tame map we can get bar codes for level persistence from relevant persistent 
numbers. 

Proof. 

N r (t k ,tj) = e r (t;tj,t k ) — e r (t;tj,t k+ i) — e r (t; t k ) + e r (t; t k+ i), (4.5) 

for any t k < t < tj. 

Notice that for different fs, N r (t k ,tj) remain the same. 
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To determine the numbers N r (ti, tj], N r [t{, tj) and N r [ti, tj], we introduce the following auxiliary 
numbers which can be determined by relevant persistent numbers: 

N r {ti,tj} := the number of intervals in BL r (f) which intersect the levels X ti and X tj ; 
N r {ti,tj) := the number of intervals in BL r (f) which intersect the level X t . 
with open end at tj] 

N r {ti,tj] := the number of intervals in BL r (f) which intersect the level X t . 
with closed end at tj] 

N r (ti,tj} := the number of intervals in BL r (f) which intersect the level Xt- 
with open end at U; 

N r [ti,tj} := the number of intervals in BL r (f) which intersect the level X t - 
with closed end at tj. 

N r {ti, tj) = l^(tf, tj) — l^(ti] tj-x) 
N r (ti, tj} = l r [tj] ti) — l r [tj] ti + i) 
N r (ti, tj] = N r (ti, tj} - N r (ti, t j+1 } - N r (t t , t j+1 ) (4.6) 

N r [U, tj) = N r {U, tj) - NriU^tj) - NriU-!, tj) (4.7) 

■N r \ti : tjj i r (ti,tj) 
N r [t h tj} = N r {t h tj} - NriU^tj} - NriU-^tj} 

N r [U, tj] = N r [U, tj} - N r [ti, t j+1 } - N r [U, t j+1 ) (4.8) 

□ 

In order to calculate the relevant persistence numbers l r (t), l*(t; t'), l^(t] t") and e r (t; t', t") we 
induce so called positive and negative bar codes in subsection 14.41 Positive and negative bar 
codes are defined by sublevel persistence, so they can be calculated with the methods as described 
in subsection 13.41 

We can also calculate i r (t,t') with similar method. Details will be given in subsection 14.31 
Once we get all relevant numbers, we can easily get the bar codes for the level persistence. 

4.3 The General Framework for the Calculation of Bar Codes for Level 
Persistence of Generic M-valued Linear Maps on Simplicial Com- 
plexes 

Definition 4.4 (Generic K- valued Linear Maps on Simplicial Complexes). Let X be a simplicial 
complex and |X| its spatial realization. An M.-valued linear map on X is a continuous map 
/ : |X| — y R whose restriction to each simplex is linear. Let X be the set of vertices of X. f is 
generic if / : X — > R is injective. (See page 2, jl]) 
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Theorem 4.10. Generic W-valued linear maps on simplicial complexes are tame. 



Proof. Let / be a generic R- valued linear map on a simplicial complex X with N +1 vertices. Let 
t < • • • < t N be the the values of / on the vertices of X. Label vertices of X such that f(xi) = ti, 
< i < N. 

(i) For s < t with £j_j < s,t < ti one produces a canonical homeomorphism 9 s t '■ X s ^ — >■ 
X t x [s,t] which intertwines the map / with the second factor projection of X t x [s,t] on [s,t]. 

Given a with a fl X s . t ^ <p- Each point x G D cr lies on an unique straight line segment 
\{x) which is the trajectory of V/| CT w.r.t the flat metric on a. Denote by L(/)(resp. R(l)) the 
intersection of this line with X s (resp. X t ) and by p s ,t( x ) '■= L(l(x))(resp. q s ,t(%) '■= R(K X ))) the 
obviously continuous map X s>t fl cr — > X s fl cr(resp. X s>t fl cr — > X t fl cr). Consider p S)t : X s>t fl a — )■ 
X s Hax [s,t](resp. q S)t : X Sit ncr — > X t ncrx [s,t]) the map defined by p s ,t( x ) — (Ps,t( x ), f( x ))( Tes P- 
q s ,t(%) = (Qs,t(x), f(x))). Clearly p Sjt and q S}t are homeomorphisms which intertwines the map / 
with the second factor projection of X t x [s,t] on [s,t\. 

For any o\ and o"2, p s ,t and g Sit agree on the intersection a% flifj. So we can define ^ s ,t|o-, 
the restriction of ^ S)( on each a, as g S) t. In this way, one produces a canonical homeomorphism 
9 S j : — > X t x [s, t] which intertwines the map / with the second factor projection of X t x [s, t] 
on [s, t]. 

Therefore, for any t ^ to, ti, ■ ■ ■ , tN, there exists e > so that / : — e, t+e) — >■ (£ — e, £ + e) 
and the second factor projection X t x (£ — e, t+e) — > (t—e, t+e) are fiber-wise homotopy equivalent. 

(ii) For s < t with s = £j_i, t < tj and a with cr fl -X"^ t 7^ 0, it is easy to see that X t ._ ljt fl a is 
the mapping cylinder of a canonical map X t fl cr — > X ti _ 1 fl cr which prove that J X't i _ 1 1 H ex retracts 
by deformation on X ti _ 1 fl a . For any ai and o"2, the above deformation retractions agree on the 
intersection o\ fl a 2 . So X 4i _ 1)t retracts by deformation on 

For s < t with tj_i < s,t = t», we can prove retracts by deformation on X 4i in the same 
way. 

□ 

Let / be a generic M-valued linear map on a simplicial complex X with N + 1 vertices. Let 
t < • • • < t;v be the the values of / on the vertices of X. Since / is tame, we can define bar codes 
for level persistence of / : \X\ — > M. By Lemma [4.5^ the bar codes for level persistence of / are 
intervals of the form [ti,tj] with tj < tj and (tj,tj), [ti,tj], [ti,tj) with tj < tj. Note 

Lemma 4.11. For the map f : \X\ — > R defined above, the bar codes of the form (tj,tj + i) do not 
exist. 

Let / and X be defined as above. Let Mx be the boundary matrix of X which records the 
boundary map d : X — > X. Start with M x and the values on vertices t < ti < • • • < t^, we can 
calculate the bar codes for level persistence with two methods: Method 1 , which involves only 
i r (t, £'), and Method 2, which involves t'), Z~(t; t"), e r (t; t", t') and i r (t, £'). Finally one use the 
pervious results to determine the bar codes. It appears that Method 1 is simpler, however we will 
see later that Method 2 is about 8 times faster than method 1. 

5 Any simplicial map is tame, the arguments although similar are more elaborate to write down in the general 
case. 
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Definition 4.5. Given any t, we get a polytopal complex 

X t :={\a\n\X\ t | aeX}. 

Given any s < t, we get aother polytopal complex 

X SJt :=X s UX t U{ H n \X\, tt I a G X }. 

For s = i, we define X Sjt as X t . 

Method 1 (coefficients in Z 2 ) 

Given s < t, order cells in X Sjt in the following way: 

(i) if r G X s , r' G X t and r" G X Sjt , then r -< r' -< r"\ 

(ii) if dim(r) < dim(r'), then r -< t' . 

The ordering of the vertices provides orientation on simplices and then on the cells obtained 
as "cuts". We can write a matrix M Xs , t for the boundary map d 8>t : X 8>t — > X 8>t with respect to 
the order of cells above. 

From the reduced form R(Mx s t ) calculated by Algorithm 2.1.., we can read the dim(iJ r (|X| s ) — > 
H r (\X\ Sjt )) and dim(H r (\X\ s U \X\ t ) -)■ H r (\X\ 8 , t )). 

Let ni = #(X S ), n 2 = #{Xt) an d n — #(^s,t)- For a zero column j, 1 < j < n±, if there 
does not exist column k > j such that low(k) = j then column j contributes a generator in 
H r (\X\ s ) — > H r (\X\ Sjt ). Therefore dim(H r (\X\ s ) — > H r (\X\ Sjt )) = number of such columns j. 

For a zero column j', 1 < f < n\ + n 2 , if there does not exist column /c > f such that 
low(k) = j' then column j' contributes a generator in i? r (|X| s U \X\ t ) — > H r (\X\ Syt )- Therefore 
dim(H r (\X\ 8 U \X\ t ) — > H r (\X\ S;t )) = number of such columns j'. 

Let's change the order of cells in X 8jt . 

Given s < t, order cells in X s>t in the following way: 

(i) if r G X t , t' G X s and t" G X s>t , then r -< t' -< t"; 

(ii) if dim(r) < dim(r'), then r -< r' . 

We can write a matrix t for the boundary map d S)t '■ X 8jt — > X Sit with respect to the order 
of cells above. 

From the reduced form R(M' Xs ) calculated by Algorithm 2.1.., we can read the dim(H r (\X\ t ) — >■ 
if r (|X| s>t )) and dim(if r (|X| s U — >■ iJ r (|JT| S)t ) as describe above. 
Since 

I r (s, t) = img{H r {\X\ s ) -+ H r (\X\ 8it )) n img(H r (\X\ t ) -+ H r (\X\ 8>t )) 

we have 

i r (s,t) = dim(/ r (s,t)) 

= dim(img(# r (|X| s ) if r (|X| ilt )) H img(H r (\X\ t ) -+ H r (\X\ 8>t ))) 
= dim(img(if r (|X| s ) -> if r (|X| ilt ))) + dim(img(# r (|X| t ) -> if r (|X| a>t ))) 
- dim(img(ff r (|X| a ) H r (\X\ 8tt )) + img(tf r (|X| t ) if r (|X| a , t ))). 

Since 

img(H r (\X\ 8 ) -+ H r (\X\ 8>t )) + img(H r (\X\ t ) -+ H r (\X\ 8>t )) 
= img((H r (\X\ s ) © H r (\X\ t )) -+ H r (\X\ Stt )) 
= img(H r (\X\ s U\X\ t )^H r (\X\ 8tt )) 
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we have 



i r {s,t) = dim(img(# r (|X| s ) -+H r (\X\ Sjt )))+dim(img(H r {\X\ t ) -+ H r (\X\ a , t ))) 
- dim(img(# r (|X| s U \X\ t ) -> H r (\X\ Sit ))) 

for s < t. 

For s = t, we have 

i r (s, s) = dim(H r (\X\ s )) 

which can be read from the reduced form R(Mx s ) directly. 

Once we get i r {s, t) for any s < t, we can get the numbers of all bar codes of level persistence 
according to Theorem 14.81 

Observe that in view of the tameness of / we only need finitely many i r (s, t). Let t_i = t — 1, 
£jv+i — + 1 and £4+1/2 = {U + £i+i)/2 for — 1 < i < N. It is enough to know i r (s,t) for 
s,te {£ i/2 | - 1 < % < 2N + 1}. 

Therefore we need to calculate reduced forms of about 4N 2 matrices in order to get r-bar codes 
of level persistence. 

Method 2 (with coefficients in Z 2 ) 

Method 2 is based on the formulas of Theorem 14.91 

So called positive and negative bar codes are introduced in the next subsection in order to 
calculate l r (ti), tffatj), and e r (ti;tk,tj) for tu < t% < tj efficiently We need to calculate 

reduced forms of about 2N matrices in order to get Z r (tj), £j), l~(tf,tj) and e r (tf,ti-,tj) for 

tk — — tj • 

We still need to calculate v(£«; tj) for tj < tj. We have 

i r (ti,tj) = dim(img(# r (|Xy H r (\X\ titt .))) + dim(img(H r (\X\ t .) H r (\X\ ti>t .))) 
-dim(img(F r (|X| t , U H r (LY| tj , t .))). 

for tj < £j. 

dim(img(if r (|X|i. U \X\ t .) — > H r {\X\ t . t .))) can be calculated in the same way as described in 
Method 1. 

dim(img(if r (|Xy -)■ H r (\X\ Uttj ))) and dim(img(if r (|X|i j ) -)• H r (\X\ titt .))) can be calculated 
at very little cost with the knowledge of l r {ti), l£(tf,tj) and l~(ti;tj). 

dim(img(H r (\X\ k ) ^ H r (\X\ k , tj ))) 
= dim(F r (|Jfy) - dim(ker(tf r (|X| tj ) -> F r (|X| ti , t .))) 
/ r (£j) / r (tj, £j) 

dim(img(i? r (|X| t .)^F r (|Z| tiit .))) 
= dim(tf r (|Xy) - dim(ker(# r (|X| t .) tf r (|X | tj>t .))) 
l r (tj) Z r (tj, tj) 

We need to calculate reduced forms of about N 2 /2 matrices in order to get i r (tj; tj) for tj < tj. 
Therefore we need to calculate reduced forms of about N 2 /2 + 2N matrices in order to get 
r-bar codes of level persistence. 
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4.4 Definition of Positive and Negative Bar Codes of a Tame Map 

Suppose / : X — > R is a tame map with t\ < t<i < ■ ■ ■ < ijv the critical values. 

Proposition 4.12. Define t m+1 / 2 = (t m + t m+ i)/2, 1 < m < N — 1. Given t\<u<s<t< t^ , 
there exist k,i,j such that t^_i <u< t^fa < s < t i+ i,tj <t< tj+i. We have 

H r (X u ), s = U 

Hr(X t . +1/2 ), U < S < t i+ i 



H r (X s 



ker(H r (X s ) H r (X t)t )) 

ker{H r {X ti ) H r {X taj )), s = U 

ker(# r (X tf+1/2 ) ->• H r (X ti+1/2 ,tj)), tt < s < t i+1 < t 

0, ti < s < t < t i+ i 

ker(H r (X s ) H r (X u . s )) 

kex{H r {X ti ) H r {X tk , ti )), s = U 

ker(H r (X t . +1/2 ) H r {X thit . +1/2 )), u<U<s< t i+l 

0, ti < u < s < t i+ i 

ker(H r (X s ) -> H r (X s , t )) n ker(# r (X s ) H r (X u , s )) 

kex(H r (X ti ) H r (X ti , tj )) n ker(# r (X ti ) H r (X tk , ti )), s = U 

kev(H r (X ti+1/2 ) tf r (X ti+1/2 ,t,)) n ker(tf r (X ti+1/2 ) iJ r (X tfc , ti+1/2 )), 

U < ti < S < ti+i < t 

0, otherwise 

Proof. We will verify the case u < ti < s < t i+ i < t, other cases can be proved similarly. 
The commutative diagram (all maps are inclusions) 



induces 



X„ 



H r (X tkj s) •* H r {X t 



X s i 




H r (X s>tj ) 



H r (X U)S ) H r (X ait ) 

We have two isomorphisms in the diagram above, since X UjS and X^ retracts by deformation 
onto X fcjS and X S) t. by Lemma [4.21 So we have 

ker(# r (X s ) -> H r (X S!t )) = kei(H r (X s ) -> H r (X S;t .)) 
ker(H r (X s ) # r (X u , s )) = ker(# r (X s ) H r {X tktS )) 
and (*) 
ker(# r (X s ) -»■ H r {X s +)) nkei(H r (X s ) H r (X u>s )) 
= ker(H r (X s ) -> H r (xL)) H ker(# r (X s ) -> H r (x' tktS )) 
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When s = t i} we get corespondent isomorphisms in this proposition. 

When ti < s < t i+ i, wlog, suppose ^+1/2 < s < t i+1 . By Lemma H31 Xi. +1/2jS retracts by 
deformation onto -X*. 2 and X s , the commutative diagram below (all maps are inclusions) 



X. 



X 




*i+l/2,s 



' ^*i+l 



+1/2,1.7 



induces 



*fci*i+l/2 *i+l/2 



H r (X s 



H r (X tk 




H r (X t . +1 



+l/2,fl' 



H r (X S;t .) 



H r(X u+1/2ttj/ 



H r {X tkM+1/2 ) H r (X ti+1/2 ) 

From the diagram above, we have 
H r (X 8 ) = H r (X t . +1/2 ) 

ker(H r (X s ) # r (X S)t .)) S ker(tf r (X 4s+1/2 ) ff r (X ti+1/2>t .)) 
ker(tf r (X s ) # r (X tfcjS )) ker(tf r (X ti+1/2 ) tf r (X tfcjti+1/ J) 
ker(# r (X s ) -> H r (X s , tj )) nker(H r (X a ) -> H r (X tk>s )) 
= ker(H r (X ti+1/2 ) flr(X^ 1/aA )) n ker(tf r (X ti+1/2 ) -> tf r (X, fc , w )) 

Combined with (*), we prove Proposition 14.121 when u <ti< s < t i+ i < t. 



□ 



From Proposition 14.121 we can see that relevant level persistent numbers Z r (s), Z^(s; £), Z r (s; w) 
and e r (s;t, w) for ti < -u < s < t < t N are determined by / r (*(i+i)/2), #"(*(»+i)/2;*j), ^(%+i)/2j*fc) 
and e r (t ( i + i )/2 ; i,-, **) for 1 < z < 2iV - 1, 1 < k < (i + l)/2 <j<N. 



Proposition 4.13. Lei 



/a 



A- 



/» T/ /i+l 

— *" 5 



A' 



6e a diagram of vector spaces V\, • • • , Vn over a field k. 
Denote 

{fk°---° fi-2 ° fi-i k < % 
id k = i 

fk° ■ ■ - fi+i° fi k > i 
W k = img(g k : V t -> V k ) 
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We can always choose a basis {v±, ■ ■ ■ ,v n } of Vi such that condition (Pi) below is satisfied for 
1 < k < N. 

Condition(Pi) : Nonzero elements of {<7jfc(i>i), • • • ,g k (v n )} is a basis ofW k . 

Proof, (a) Let {x\, ■ ■ ■ ,x n } be a basis of Vi, then (Pi) is true for k = i. Suppose (Pi) is true for 
i < k < j, we can modify the basis so that (Pi) is also true for i < k < j + 1. 

If gj + \(xi) = for 1 < i < n, then (Pi) is satisfied for i < k < j + 1; otherwise, there exist 
1 < ni < ri2 < • • ■ < n r < n such that gj + i(x ni ), ■ ■ ■ ,gj + i(x nr ) are nonzero and gj + i(xi) = for 
/ {ni, • • • ,n r }. 

If r = dim(W^ + i), then {gj + i(x ni ), ■ ■ ■ , fi'j+i(^rt r )} forms a basis of Wj + i, otherwise, there 
exist 2 < r\ < r such that gj+i(x ni ), ■ ■ ■ , gj+i^x^^J are linearly independent, but gj+i(x nri ) = 
n— l 

^ aig j+1 (x ni ) for some a x G k. 
i=i 

n-l 

Let x' nr ^ = x Hri — aix nr Replace x nr by x' n ^, {x±, ■ ■ ■ , ,x n } is still a basis of 

i=i 

Vi. Since g j+1 (x ni ), ■ ■ ■ ,g j+1 (x nri ) are nonzero, gk{x ni ), • • • ,9k(x nri ) are nonzero for « < k < j. 
Since (Pi) is true for {x±, • • • ,x n } and i < k < j, gk(x ni ),--- ,gk(xn ri ) are linearly indepen- 
dent for % < k < j. Hence g k (x' nri ) ^ for % < k < j. Replace g k (x nri ) by g k (x' nri ), 
{g k (xi), • • ■ ,gk(x nri ), • • • ,9k(x n )} and {g k (xi), ■ ■ • , g k (x' nri ), ■ ■ ■ ,g k (x n )} have the same number 
of nonzero elements for i < k < j, since g k (x nri ) and gk(x' nr ^) are nonzero for i < k < j. 
In addition, those nonzero elements can be linearly represented by each other. So nonzero el- 
ements of {g k (xi),--- , <7fc(a4 ), • • • ,g k (x n )} is still a basis of W k for i < k < j. Hence (Pi) 
is still true for {x±, • • ■ ,x' n ^,--- ,x n } and i < k < j, but the number of nonzero elements 
of {<7j + i(xi), • • • , gj + i{x' ), • • • , gj+i(x n )} is one less than the number of nonzero elements of 
{9j+i(xi), ■ ■ ■ ,g j+ i(x' nri ), ■ ■ ■ ,g j+1 (x n )} since g j+1 (x' nri ) = by definition and g j+1 (x nri ) ^ 0. 

By induction on the number of nonzero elements of {gj + i(xi), ■ ■ ■ , gj + i(x nri ), • • • , gj + i(x n )}, 
there exist a basis {x[, ■ ■ ■ ,x' n } of Vi such that (Pi) is true for i < k < j + 1. 

Then by induction on j, we get a basis {yi, • • • , y n } of Vi such that (Pi) is true for i < k < N. 

Similarly, we can get a basis {z±, • • • , z n } of Vi such that (Pi) is true for 1 < k < i. 

(b) Definition: Negative Life Given z% G Vi as in the basis above, if g k (zi) ^ and 
9k-i( z i) = for some 2 < k < i or g k (zi) ^ for k — 1, define the negative life of zi to be i — fc + 1. 

Order ^i, • • • ,z n according to their negative life increasingly. 

i 

Claim: For any given 1 < I < n — 1, if we replace Zi + \ by = Zi + \ — & m -2 m for any given 

m=l 

6 m G k, {^i, • • • , • • • , z n } still satisfies (Pi) for 1 < k < i. 
Proof of the claim: 

Since z±, ■ ■ ■ ,z n are ordered according to their negative life increasingly, the negative life of 
z m is less than the negative life of zi + \ for 1 < m < I. Suppose g kl (zi + i) ^ and ^1—1(^+1) — 
for some 2 < k\ < i, then we must have g k (z m ) = for all 1 < k < k\ — 1 and 1 < m < I + 1. 
Therefore g k (z l+l ) = g k (z' l+1 ) = for 1 < k < h - 1. Hence {fi^i), • • • ,g k (z' l+1 ), ■ ■ ■ , g k (z n )} is 
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the same as {gk(zi), ■ • • , gk(z n )} for 1 < fe < fei - 1. So {zi, • • • , z' l+1 , • • • , z n } satisfies (Pi) for 
1 < k < ki - 1. 

Given k\ < k' < i, we have 7^ 0. There exist 1 < Zi < I, such that gk>(z m ) = for 

1 < m < h and g k >{z m ) ^ for Z x + 1 < m < I + 1. Hence ^'(^+1) = 9k'{zi+i) - ^ b m g k >(z m ). 

m=h+l 

Therefore nonzero elements of {flfe'C^i), • • • , gk'(z n )} and {^'(24), • • • , #fc/(^' +1 ), > 9k' {z n )} can be 
linearly represented by each other. Since zi, ■ ■ ■ ,z n satisfies (Pi) for 1 < k < i, nonzero elements of 
{^/(^i), • • • , gk'(z n )} form abasis of Wk>- Then nonzero elements of {gk'(zi), •• • , gk>(z' l+i ), ■ ■ ■ ,gk>(z n 
also form a basis of Wk>- So {z±, • • • , z' l+1 , • • ■ , z n } satisfies (Pi) for k\ < k < i. 
The claim is proved. 

n-i 

Notice that in part (a), we replace x nri by x' nr = x Uri — 2_. a i x n r We only need this type 

of replacement to modify a basis {xi, ■ ■ • ,x n } into a basis {yi, ■ ■ ■ ,y n } that satisfies (Pi) for 

ri — 1 

i < k < N. And the replacement x Hri by x' Ur = x Uri — a>ix ni is only a special case of the 

1=1 

replacement in the above claim. Therefore, start from a basis {zi, ■ • • , z n } of Vi such that (Pi) is 
true for 1 < k < i which is ordered according to their negative life increasingly, we can use the 
same process in part (a) to modify this basis until (Pi) is satisfied for 1 < k < N. 

□ 

We use the concept "positive bar code" because it is consistent with the sublevel persistence 
associated to the space -^[t (i+1)/2 >°°) an d the map f\x [t . +i20o) - Similarly, we use the concept 
"negative bar code" because it is consistent with the sublevel persistence associated to the space 
X(-oo */•_■_■.•> /oi and the map fix, , 

Let d(i+i)/ 2 = dim(P r (X 4(i+1)/2 )). Choose a basis {ci, • • • ,c d(i+1)/2 } of H r (X t(i+1)/2 ) s.t. for 
any k < (i + l)/2 and j > (i + l)/2, nonzero elements of {erfc(ci), • • • , Cfc(cd {l+1)/2 )} is always a 
basis of img(cr fc : H r (X t( . +1)/2 ) ->■ # r (X tfejt{i+1)/2 )) and nonzero elements of {^(ci), • ■ ■ , r 3 -(c d(i+1)/2 )} 
is always a basis of img(rj : H r (X t .. +1)/2 ) —> H r (X t .. +1)/2!tj )), where a k and Tj are induced by 
inclusion. This is possible by Proposition 14.131 

Definition 4.6. Positive and Negative Bar Codes 

(i) The set of r-positive bar codes B+(f; t( i+1 y 2 ) of X t(i+1)/2t00 is a collection of d( i+ iy 2 intervals 
described as follows. For any generator c a G {ci, ■ • • , Cd (i+1)/2 }, if c a dies upward at tf(tji > t^ + iy 2 ), 
then we have an interval (t(i+i)/2, V) i n Brif'i *(*+i)/2)? otherwise, if c a never dies upward, we have 
an interval (t( i+1) / 2 , 00) in B+(f; % + i)/ 2 ). 

(ii) The set of r-negative bar codes B~(f;t^ + iy 2 ) of A_ oo t(i+1)/2 is a collection of d(i+i)/2 
intervals described as follows. For any generator c& G {ci, ■ ■ • , Q (i+1)/2 }, if c& dies downward at 
£fe'(^fc' < t(i+i)/2), then we have an interval (£&', %+i)/2) in B~(f;t( i+ iy 2 ); otherwise, if c& never 
dies downward, we have an interval (— 00, t^ + iy 2 ) in B~(f;t(i + i)/ 2 ). 

(iii) Each generator correspond to an interval (a&, £(j+i)/2) in the negative bar code B~(f, t(»+i)/2 
and an interval (t^ + iy 2 ,b k ) in the positive bar code B^(f,t^ + iy 2 ). Therefore each genera- 
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tor c k defines an interval (a k ,b k ). Define B r (f,t( i+ iy 2 ) as the collection of the above intervals 

{OfcAOl 1 <k< d {i+1)/2 }. 

We have introduced these additional numbers given the fact they can be efficiently calculated as 
indicated in subsection 14.51 and they are equivalent to the relevant persistent numbers £ r (t(i+i)/ 2 ), 
i+(t( i+1 )/ 2 ;tj), l~(t(i+i)/2]t k ) and e r (t(i +1 y 2 ',tj,t k ) as indicated below. 

We can get the relevant persistent numbers l r (ta + iy 2 ), ^(%+i)/2i tj), l~(tr i+ iy 2 ; t k ) and e r (tr i+ iy 2 ; tj. 
from the positive and negative bar codes: 

l r (t (l+1)/2 ) = dimtf r (X t(i+1)/2 ) = \B+(f,t {i+1)/2 )\ = \B;(f,t {i+1)/2 )\ (4.9) 

C(*(i+i)/2;*i) = Uh+i)/2) - #{(tm)/2,b) e B+u\t(i+m) I b > tj} (4.10) 

l~(t( i+ iy 2 ;t k ) = l r (t( i+1 y 2 ) - #{(a,i( i+ i)/ 2 ) G B~(f,t(i + iy 2 ) \ a < t k } (4.11) 
&r\t{i+l)/2'i tj, tk) 

= lr(t( i+ i)/ 2 ) - (Z r (t(i+i)/ 2 ) - l?(t( i+ iy 2 ] tj)) - (/ r (t(i + l)/ 2 ) - lr{t(i+l)/2\ tk)) 

+#{(a,6) G B r (f,t( i+1 y 2 ) \a<t k ,b> tj} 

= ^(^(i+l)/2; tj) + ^(t(i+l)/ 2 ; t k ) — / r (^(i+l)/2) + b) G -B r (/, *(»+l)/2) I 0, < tk, b > tj} 

(4.12) 

We can also get the positive and negative bar codes from the relevant persistent numbers 

lr(t( i+ i)/ 2 ), lr{t(i+\)/ 2 ,tj), l~(t(i + i)/ 2 ;t k ) and e r (t(i +1 y 2 ; tj, t k ): 

#{(t(i+i)/2,tj) g B r (/, i( i+1 )/ 2 )} = Z r (*( i+ i)/ 2 ;tf) - / r (*(i+i)/ 2 ; ij-i) 

*(i+l)/2> G S7(/,t( j+ i)/ 2 )} = lr(t{i+l)/2',t k ) —l~(t(i + i)/2',t k+ l) 

#{(t k , tj) G -B r (/, ^(1+13/2)} = e(t( i+ i)/ 2 ; tj, £*.) — e(i( i+ i)/ 2 ; tj, t fc+1 ) — e(t( i+1 )/ 2 ; tj_i, t fc ) 

+e(t( i+ i)/ 2 ; tj-i, tfc+i) 

Therefore, positive and negative bar codes and relevant persistent numbers l r (t(i + iy 2 ), l+{tu + iy 2 ; tj), 
l r(t(i+i)/2\t k ) and e r (t( i+1 )/ 2 ;tj,t k ) are equivalent. 

4.5 Computing Positive and Negative Bar codes of Generic M-valued 
Linear Maps on Simplicial Complexes 

Let X be a simplicial complex with N vertices. Let / : \X\ — > K be a generic linear map on X, 
with ti < ■ ■ ■ < t N being values on vertices x%, ■ ■ • , of X. According to Method 2 in subsection 
14.31 we need to calculate l r (ti), l^{U;tj), l~(ti,tj) and e r {ti,t k ,tj) iorl<k<i<j<N. So we 
only need to calculate B+(\X\;U) and B~(\X\;ti) for 1 < i < N, then plug in (Oil- ( 13121) . 

We'll describe how to compute positive bar codes in details and negative bar codes briefly at 
the end. 

In order to get B+(\X\; ti), we have to consider the filtration below 

\x\ t . c \x\ hM+1 c • • • c \x\ t . ttN 

(see Definition 4.1 and 4.2, [3]). 

It's hard to compute the positive bar codes from this filtration directly. Instead, we'll consider 
an equivalent filtration. 



57 



Definition 4.7. Given a j-simplex a = [x no ,--- ,x nj ] of X, 1 < no < • • • < fij < N, define 
t m in{\<7\) = t no , t max (\a\) = t nj . For t no < s < t nj , define |cr| s = |cr| n \X\ S , \a\ Sj00 = \a\ n \X\ Sj0O , 

W\-oo,s — \°~\ n |x|_oo iS . 

Extend t min and t max to |cr| s , |cr| S;00 and H_oo iS , we have 

^mm(|°is) tmax ( \&\s) 
tmin\\&\s,oo) ^max 



s,ooj v n;i 



Note: It is easy to see that all cells in X tuOQ are {\a\ \ t m i n (\a\) > {\cr\ ti \ t m i n (\a\) <U < 
t m ax{W\)} or {\cr\ ti:00 | t min (\a\) < t t < t max (\a\)}. 

Definition 4.8. Given t x < s < t < t N , define Y 8jt to be a cell complex consist of cells in X s oo 
that are contained in the space X_ 00tt . In another word, cells in Y Sjt are the cells c in X s oo which 
satisfy t max (c) < t. 



Lemma 4.14. Let a = [x , • • • ,x n ] be an n-simplex and f : \a\ — > R be a generic linear map 
on it. Suppose f(xo) < f( x i) < •■• < f(x n ), let tj = f(xj),0 < j < n. Given t < s < t n , 
suppose U < s < t i+ i for some < i < n — 1. Let |<7|_oo )S = oo, s], then |it|_oo,s retracts by 

deformation onto \ [xo, - ■ ■ ,Xi}\. 

n n i 

Proof. Given x G |er|-oo,s) write x as a j x j, where < a j < l,^^Oj = 1. We have ^2 a 3 > 0, 

3=0 3=0 3=0 

n n 

otherwise a 3 - = 0,0 < j < i, x = a j x ji f( x ) — a jtj > s, since tj > s for j > i + 1 and 

j=i+l j=i+l 

n 

dj = 1. This is a contradiction with x E |(t|_oo iS . 

j=i+l 

Define go : |er|-oo,s — > \&\-<x>,s to be the identity map. 
Define 

9l ■ \&\— oo,s ^ oo.s 



X — a 3 X 3 ^ ~~T ajXj 

3=0 

gi is a well-defined continuous map and <7i||[xo,-,xi]| is the identity. 
Define 

G : |cr|_oo,s X 7 — >■ |cr|_oo jS 

(x,r) (1 - r)go(x) + rg 1 (x) 

G is a deformation retraction from |<t|-oo,s onto |[xq, • • • , and it is canonical. 



□ 
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Figure 7: A cone with apex x n and base a convex cell with vertices Xo, ■ ■ ■ , x n -\ 

Lemma 4.15. Let a be a cone with apex x n and base a convex cell with vertices Xq, ■ ■ ■ , 
See Figure^ above. Let f be a linear map on \o~\ s.t. f(xo) < f(%i) < •• ■ < f{x n -i) < f{x n )- 
For f(x n -i) < s < f(x n ), |cr|_oo, s = oo, s] retracts by deformation onto the base, again by a 

canonical retraction. 

Proof. Since a is a cone, for any point x G |er|(x 7^ x n ), there exists a unique x' in the base s.t. 
x — (1 — a)x' + ax n , for some < a < 1. 

Define g : |c"|-oo,s — > |°"|-oo,s to be the identity map. 

Define 

9l ■ \&\— 00, s ^ oo,s 

x 1 — y x 1 

gi is a well-defined continuous map and gi\thebase is the identity. 
Define 

G : |o-|_oo >s x / — > |a|-oo,s 

(x,t) f-^ (1 - r)g (x) + rg^x) 

G is a deformation retraction from |er|_oo,.s onto the base. 

□ 

Lemma 4.16. Let X = [xq,xi, ■ ■ ■ ,x n ] be an n-simplex and f be a generic linear map on \X\. 
Suppose f(xo) < f(xi) < ■ ■ ■ < f(x n ), let tj = f(xj), < j < n. Given t < s < t < t n , let Y Stt be 
as defined in Definition ^. S\ then X s>t retracts by deformation onto Y Syt . See Figured above. 

Proof. Suppose U < s < t i+ i, tj <t < tj + x, 0<i<j<n — 1. When i = j, Y Sjt = X s and X Sjt 
retracts by deformation onto X s by Lemma l4T4l When i < j, X Stt retracts by deformation onto 
X s j.. Since Y s>t = Y s>t ., we only need to prove X Sit . \ Y S;t .. 

Note: "\" means "retracts by deformation onto" here and after. 

X SjOQ is an n-dim convex cell, which can be viewed as union of n-dim cones with apex x n and 
base (n — l)-dim faces of X s>00 which do not contain x n . 
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X 




Figure 8: X, X S)t and Y B>t 

By Lemma I4.15[ the intersection of each of the cones and X s>tn _ 1 retracts by deformation to 
the base of that cone. 

Since the deformation retractions are compatible on the intersection of the cones, X s j n _ 1 = 
union of intersections of the cones and X 8 , t n _i \ union of bases of the cones = (n — l)-dim faces 
of X s>00 which do not contain x n = Yg^^. 

Since 3 / ^ itTi _ 1 are union of (n — l)-dim faces of X SiOC which do not contain x n , i^ )tri l = Y Sitn _ 2 \J 
cells of i^ jtTi _ 1 which contain x n -\. 

Let o"i, o"2 be (n — l)-dim cells of Y s tn _^ which contain <j\ fl a 2 ^ <f). <J\ can be viewed as 

union of (n — l)-dim cones with apex 

By Lemma 14.151 a± fl X Sjtn _ 2 \ a sub cell complex of Y S)tn _ 2 . Similarly, cr 2 H X Sttn _ 2 \ a sub 
cell complex of Y S)tn _ 2 ■ 

The deformation retractions on a\ fl X Sj t n _ 2 and cr 2 D X S) t w _ 2 induce the same deformation 
retractions on a x H cr 2 H X Sjtn _ 2 . Therefore K S)tn _ 1 fl X Sitri _ 2 \ F s><n _ 2 . 

^ s ,t„-i \ ^A-i implies X sA _ 2 \ F Sjtn _ x nX Sjf „_ 2 . So X sU _ 2 \ F Sjt „_ 2 . Suppose X s>tk \ 
we have X Sitfc _ 1 \ K Sif)fe n 

Let a be a cell of Y s>tk which contains Xk- o can be viewed as union of cones with apex Xk and 
base convex cells in 3 / s ,t i ,_ 1 . 
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By Lemma 14. 15[ a fl X s ^ k _ 1 retracts by deformation onto a sub cell complex of Y s<tk _ 1 . 
The above deformation retractions are coherent on the boundary of the cones, so they induce 
a deformation retraction from Y Sttk fl X Sytk _ 1 onto Y Sitk _ 1 . 
By induction X s j. \ Y Sit .. 

□ 

Proposition 4.17. X Sit retracts by deformation onto Y S)t - 

Proof. Four types of simplices of X have nonempty intersection with space |X| Si £: 

(1) {a G X I t min (a) < s,s < t max (a) < t} 

(2) {a G X | t min (a) < s,t max (a) > t} 

(3) {a G X | t min {a) > s,t max (a) < t} 

(4) {a G X | s < t min {a) < t,t max (a) > t} 
Therefore, there are four types of cells of X Sit : 

(1) {os,oc I o G X,t min (a) < s,s < t max (a) < t} 

(2) {a S)t | cr G X,t min (a) < s,t max (cr) > t} 

(3) {cr g X | t min (a) > s,t max (a) < t} 

(4) {cr-oo^ | a G X, s < t min {a) < t,t max (cr) > t} 

There are five types of cells of X s>00 with respect to t (s < t): 

(1) {os,oo I o- G X, t min (a) < s,s < t max (a) < t} 

(2) {cr s ,oo I o- G X, t min (a) < s,t max (a) > t} 

(3) {a G X | t min (o") > s,t max (a) < t} 

(4) {a G X | s < t min (a) < t,t max (a) > t} 

(5) {a G X | t min (a) > t} 

There are four types of cell complexes of Y Sjt : 

(1) {os,oc | o G X,t min (a) < s, s < t max (a) < t} 

(2) {subcomplexes of 

c"s,oo which do not contain x m _|_i, • • • , x n | o~ — [•£()> * * * > -^n] ^ X, t m i n (^o' ) <c 
s, t max (a) > t, f([x , ■ ■ ■ , Xi]) < t, < i < m, f([x , • • • , Xi}) > t, m + 1 < i < n} 

(3) {cr G X | t min (a) > s,t max (a) < t} 

(4) {[x ,--- ,x m } | cr = [x ,--- ,x n ] G X,s < t min {a) < t,t max (a) > t,f([x ,--- ,^]) < t, < 
i < m, f([x , • • • , Xj]) >t,m + l<i<n} 

Notice that type (1) and (3) cells of X s j and Y s j are the same. By Lemma [4. 151 type (2) cells 
of X S)t retract by deformation onto type (2) cell complexes of Y Sjt . By Lemma [4. 141 type (4) cells 
of X s j retract by deformation onto type (4) simplices of Y Sj p 

□ 

By Proposition I4.17[ the filtration 

Y ti Q Y t . jU+1 C • • • C Y tutN 
provides the same bar codes as the filtration 

X ti Q X t . it . +1 C • • • C X tijtN . 
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The advantage to consider Y t . >t . instead of X tijt . is that Y tut . keeps more simplices from X 
and generates less "new" cells, which makes representation of cells and construction of boundary 
matrices easier. 

Note: From now on, we'll only consider homology groups with Z2 coefficients. H r (X; Z2) will 
be simply written as H r (X). 

Let X be a simplicial complex containing iV vertices Xi,x%, ■ ■■ ,xn- Let n = dimX, rrij = 
number of j-simplices of X, < j < n. Let / be a generic linear map on X. For simplicity, let 
f(xi) = i, 1 < i < N. Represent j-simplex of X by ( ), 1 < oo < a i < • • ■ < cbj < N. 

All information on the above simplicial complex X can be stored in n + 1 matrices named 
Simplex , Simplex^ • • • ,Simplex n . The matrix Simplex^- is an mj-by-(j + 1) matrix, which stores 
j-simplices of X as rows in lexicographic order. 



Example 4.1. 



Simplex 



/ 1 \ 

2 

3 
4 
5 

V e y 




Figure 9: Figure for Example 14.11 



/ 12 \ 

13 
23 
24 

Simplexj^ = 25 
35 
36 
45 
V 56 / 



Simplex 2 = ( 2 3 5 ) 



Definition 4.9. Initial Order 

Order all simplices of X first according to dimension increasingly, then according to lexico- 
graphic order, we call this order as initial order of simplices of X. 



Example 4.2. The initial order for Example EE] is 1<2<3<4<5<6<12<13<23< 
24 < 25 < 35 < 36 < 45 < 56 < 235. 
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Let 0i, o"2, • • • , cr m be simplices of X in initial order, where m = ^J m i- We have a boundary 

i=o 

matrix of Z2 coefficients 



<9 



0"1 
0--2 



o x o 2 ■■■ &j ■■■ a n , 
( \ 



0; 



di 



1 if Oi is a codimension-1 face of <jj 
otherwise 



We'll compute positive and negative bar codes from this boundary matrix d. 

4.6 Computing of Positive Bar codes B+(f;i) 

We'll describe how to compute positive bar codes B+(f; i) of X i)00 first. 
There are five classes of cells in X ioo : 

Pi = {0 

Pi = Wi I a e X,t min (cr) < i and t max (a) > i} 
P3 = Wi,oo I cr G X,t min (a) < i and t max (a) > i} 
Pi = {cr G X I dim(cr) > and t min (a) = i} 
P 5 = {a E X \ t min (cr) > i} 

Order cells in X iyOQ first according to these five classes, then according to initial order within 
each class. We can easily construct a boundary matrix df from d according to this order: 



<9+ := 





Pi 


Pi 


P3 


Pa 


P 5 


Pi 


( 


Qi 




Q 3 




Pi 




Mi 


Qi 






Pz 












Pa 












P5 


\ 






M 3 


M 4 



where Mj(l < % < 4) are correspondent submatrices of d, elements in Qi = 1 iff dim a = 2 and 
correspondent column in Mi contains only one "1" , Q 2 is identity matrix, elements in Q 3 = 1 iff 
dim cr = 1. 

Example 4.3. 



Let X be the same as in Example 14.11 and % = 4, we have 
Pi = {4} 
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Figure 10: Figure for Example 14.31 



P-2 
P3 
Pi 
P, 



{25| 4 ,35 

{25| 4 , 
{45} 

{5,6,56} 



4,36 
35 



1} 



.4, 235 1 

4,ooj 36 1 4 i00 , 235|4 )OC } 



Pi 
4 

/ 



Pi 



25 1 4 35 1 4 36L 235 4 25 4 



Pi 
45 




Pa 45 





V 

It's convenient to construct d£ using the above order. However, since this order is neither 
topologically consistent nor filtration compatible with Yi C Y^+i C • • • C (see Definition 
4. 2, [3]), we have to reorder those cells in X iyOQ . 
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Definition 4.10. Positive Order 

Order cells in X i oo first according to t max increasingly, then according to the initial order, we 
call this order positive order. 

For example, T\ and r 2 are two cells in X itOQ . If t max (ri) < t max (T2), then n < r 2 in positive 
order. If t max (Ti) = t max (T2), according to the note after Definition 14. 7\ Tj(j = 1,2) can be 
uniquely represented by &j,o-j\i or Cj\i tOQ for some a,- G X. If o\ < cr 2 in initial order, we require 
T\ < r 2 in positive order. 

Notice that Pi and P 2 do not change in positive order. The cells in P3, P4 and P5 permute. 
Denote the cells in P 3 , P4 and P5 in positive order by P3,4,5- Denote the boundary matrix of X itOQ 
by df when the cells are in positive order. 

Example 4.4. Consider Example I4.3[ reorder all cells in X 4 oo in positive order, we have 



Pi 



P 



3,4,5' 



P 



P 



2 



P 



3,4,5 



4 25| 4 35| 4 36| 4 235| 4 5 25| 4)0 o 35| 4 ,oo 45 235| 4 ,oo 6 36| 4 ,oo 56 
/ \ 



1 



1 1 



V 



From d~l , we will get its reduced form R{df) and read the bar code directly from it. 

Given an m-by-m matrix d with Z 2 coefficients. Let low(j) be the row index of lowest 1 in 
column j. If the entire column is zero, then low(j) is undefined. We call d reduced if for any two 
non-zero columns j and jo, we have low(j) 7^ low(jo). See page 153, [20]. The following algorithm 
reduces d by adding columns from left to right. 



Algorithm 2.1. 
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for j = 1 to m — 1 do 




if column j is nonzero do 




while there exists j > j with low (jo) 


= low(j) do 


add column j to column j 




endwhile 




let {ii, ■ ■ ■ , ik} = {i 1 < i < j — 1, low(i) < low(j)} 


such that low(ii) > low(i 2 ) > 


• ■ ■ > low(ik) 


for 1=1 to k do 




while there exists jo > j with low 


(jo) = low (I) do 


add column I to column j 




endwhile 




endfor 




endif 




endfor 





Example 4.5. The reduced form R(df) of in Example 14.41 is: 



Pi 



P, 



3,4,5 



i?(«9 4 ) := 



Pi 



4 25| 4 35| 4 36| 4 235| 4 5 25| 4j00 35| 4i0O 45 235| 4i0O 6 36| 4i0O 56 
/ \ 



3,4,5 



4 




f 25 


4 


1 35 


4 


1 36 


1 


[235| 4 


'5 




25 


4,oo 


35 


4,oo 


45 




235 1 4 ; oo 


6 




36 


4,oo 


56 

V 





We can read bar codes directly from the reduced form R(df). For the first |Pi| + \P%\ columns 
of R(df), if a column j is zero, check if any column jo o n the right satisfies low(jo) = j- If 
there doesn't exist such a column j , then we have an interval [i, oo) in the bar code B+(f;i), 
where r = dim(crj), o~j is the cell correspond to column j. If there exists such a column jo and 
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jo > I -Pi I + IP2I) then we have an interval [i,f) in the bar code B+(f;i) where r = dim(<7.,), 
f — tmax(crj ), Gj and <Tj are the cells correspond to column j and j respectively. 

Example 4.6. We can read bar code from R(df) in Example 14.51 

P + (/;4) = {[4,oo), [4,5), [4,6)} 

4.7 Computing of Negative Bar codes B~(f;i) 

About computing of negative bar codes B~(f; i), one proceed in a similar manner. Precisely, first 
replace the filtration Xi C C • • • C Xij by Xi C Z^i t i C • • • C where is the set of 

all cell complexes in X_ 00 ^ i that is contained in X kjOQ . Second, construct the boundary matrix dj . 
There are five classes of cells in X_ 00yi : 

N 1 = P 1 = N 2 = P 2 , N 3 = P 3 

iV 4 = {a G X I dim(er) > and t max (a) = i} 

N 5 = {a e X j t max (a) < 1} 

If we order all cells in X_ 00 j first according to these five classes then according to the initial 
order, we can easily construct d~ from the submatrices of d. Third, order cells in X-ao^ first 
according to t m i n decreasingly then according to the initial order (so called negative order), the 
boundary matrix of X_ OQ>i becomes d~ . Fourth, apply Algorithm 2.1 to and get reduced form 
R{d^). Finally, read the negative bar code B~(f;i) from the reduced form R(d^). 

Notice: The first |Pi| + IP2I columns of df and d~ are essentially the same, since P± U P2 and 
Ni U N 2 contain the same cells in the same order. It's the same situation for df and d~ , since 
reordering cells in positive and negative order will not affect the first |P X | + \P 2 \ cells. The same 
thing happened for the reduced form R{df) and R(d~), since we apply the same algorithm to 
them and we only add columns from left to right in that algorithm. We have the same amount 
of zero columns in R(df) and R(d~), which are in the same position and correspond to the same 
cycles of X^. Therefore, the zero columns in R(df) and R{d^) which represent generators of 
H*(Xi) are also in 1— to— 1 correspondence. We can pair up intervals in positive and negative bar 
codes according to generators. 

4.8 Numerical Experiments 

All the bar code in this section are generated by the Matlab code given in the Appendix. All 
intervals in positive and negative bar codes paired up in the given order. 

Example 4.7. Bar codes of f 1 in Figure [TT1 
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Figure 11: / : a linear map on a 2-simplicial complex 





1) = 


{[l,oo)} 


Bo(f 


1) = 


{(- 


-oo,l] } 




2) = 


{ [2, oo) } 


Boif 1 


2) = 


{(- 


-oo,2] } 




3) = 


{ [3,00), [3,4) } 


Boif 1 


3) = 


{(- 


■oo,3], (2,3] } 




4) = 


{ [4, oo) } 


Boif 1 


4) = 


{(- 


-oo,4] } 




5) = 


{ [5,oo), [5,6) } 


Boif 1 


5) = 


{(- 


•oo,5], (4,5] } 




6) = 


{[6,oo)} 


BoU 1 


6) = 


{(- 


-oo,6] } 




7) = 


{ [7, oo) } 


BoiP 


7) = 


{(- 


-oo,7] } 




8) = 


{ [8, oo) } 


BoU 1 


8) = 


{(- 


■oo,8] } 


B^f 1 


9) = 


{[9,oo), [9,10)} 


BoV 1 


9) = 


{(- 


■oo,9], 8,9] } 


BtU 1 


10) = 


= {[10,oo)} 


Boif 1 


10) = 


= {( 


-oo,10] } 



3 




2 



Figure 12: f 2 : a linear map on the surface of a tetrahedron 

Example 4.8. Bar codes of f 2 in Figure [121 

B+(f 2 ; 1) = { [1, oo) } B'(f; 1) = { (-oo, 1] } 

5+(Z 2 ;2) = { [2,oo) } 5 -(/ 2 ;2) = { (-oo,2] } 

5+(Z 2 ;3) = { [3,oo) } 5 -(Z 2 ;3) = { (-oo,3] } 

5 + (Z 2 ;4) = { [4,oo) } s -(Z 2 ;4) = { (-oo,4] } 

B+(/ 2 ;2) = {[2,4)} i?r(Z 2 ;2) = {(l,2]} 

£+(Z 2 ;3) = {[3,4)} i?r(Z 2 ;3) = {(i,3]} 
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Figure 13: f 3 : a linear map on a planar graph 



Example 4.9. Bar codes of f 3 in Figure [TBI 

B + (/ 3 ;l) = {[l,oc)} B (f 3 ;l) 

5 + (/ 3 ;2) = {[2,oo), [2,5)} B»(f 3 ;2) 

£ + (/ 3 ;3) = {[3,oo), [3,5), [3,5)} B^(f 3 ;3) 

5 + (/ 3 ;4) = {[4,oo), [4,5), [4,5)} 5 "(/ 3 ;4) 

S + (/ 3 ;5) = {[5,oo)} S -(/ 3 ;5) 



{(-oo,l]} 
{(-oo,2], (1,2]} 
{(-oo,3], (1,3], (2,3]} 
{(-oo,4], (1,4], (2,4]} 
{(-oo,5]} 




Figure 14: f A : a linear map on the union of the surfaces of two tetrahedra 



Example 4.10. Bar codes of / 4 in Figure [141 
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E + (/ 4 ;l) = {[l,oo)} 
£ + (/ 4 ;2) = {[2,oo), [2,3)} 
E + (/ 4 ;3) = {[3,oo)} 
^o + (/ 4 ;4) = {[4,oo), [4,oo)} 
5+(/ 4 ;5) = {[5,oo), [5,oo)} 
£ +(/ 4 ;6) = {[6,oo), [6,oo)} 
5 + (/ 4 ;7) = {[7,oo), [7,oo)} 
5 + (/ 4 ;8) = {[8,oo), [8,oo)} 
E + (/ 4 ;9) = {[9,oc)} 
S 1 + (/ 4 ;2) = {[2,8)} 
£+(/ 4 ;3) = {[3,8), [3,9)} 
B+(/ 4 ;4) = {[4,9), [4,8)} 
B+(/ 4 ;5) = {[5,8), [5,9)} 
5 1 + (/ 4 ;6) = {[6,9), [6,8)} 
5+(/ 4 ;7) = {[7,9), [7,8)} 
£+(/ 4 ;8) = {[8,9)} 



5 -(/ 4 ;l) = {(-oo,l]} 
J B -(/ 4 ;2) = {(-oo,2], (-oo,2]} 

W 4 ;3) = {(-oo,3]} 
5 -(/ 4 ;4) = {(-oo,4], (3,4]} 

S -(/ 4 ;5) = {(-oo,5], (3,5]} 

W 4 ;6) = {(-oo,6], (3,6]} 

S -(/ 4 ;7) = {(-oo,7], (3,7]} 

^o"(/ 4 ;8) = {(-oo,8], (3,8]} 

W 4 ;9) = {(-oo,9]} 
5r(/ 4 ;2) = {(l,2]} 
fir(/ 4 ;3) = {(l,3], (2,3]} 

flr(/ 4 ;4) = {(2,4], (1,4]} 
Sr(/ 4 ;5) = {(1,5], (2,5]} 
£f(/ 4 ;6) = {(2,6], (1,6]} 
flr(/ 4 ;7) = {(2,7], (1,7]} 
Sr(/ 4 ;8) = {(2,8]} 
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